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Abstract 



We are interested in the asymptotic behavior of critical Galton- Watson trees whose 
offspring distribution may have infinite variance, which are conditioned on having a large 
fixed number of leaves. We first find an asymptotic estimate for the probability of a Galton- 
Watson tree having n leaves. Secondly, we let in be a critical Galton- Watson tree whose 
offspring distribution is in the domain of attraction of a stable law, and conditioned on 
having exactly n leaves. We show that the rescaled Lukasiewicz path and contour function 
of in converge respectively to A" exc and H exc , where X exc is the normalized excursion of a 
strictly stable spectrally positive Levy process and H exc is its associated continuous-time 
height function. As an application, we investigate the distribution of the maximum degree 
in a critical Galton- Watson tree conditioned on having a large number of leaves. We also 
explain how these results can be generalized to the case of Galton- Watson trees which are 
conditioned on having a large fixed number of vertices with degree in a given set, thus 
extending results obtained by Aldous, Duquesne and Rizzolo. 

Keywords. Random trees, invariance principles, scaling limits, conditioned Galton- 
Watson trees, Stable trees. 
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Introduction 

In this article, we are interested in the asymptotic behavior of critical Galton- Watson trees whose 
offspring distribution may have infinite variance, and which are conditioned on having a large 
fixed number of vertices with degree in a given set. We focus in particular on Galton- Watson 
trees conditioned on having a large fixed number of leaves. Aldous [U [2] studied the shape of large 
critical Galton- Watson trees whose offspring distribution has finite variance, under the condition 
that the total progeny is equal to n. Aldous' result has then been extended to the infinite variance 
case (see e.g. \12\ I13|). In a different but related direction, the effect of conditioning a Galton- 
Watson tree on having height equal to n has been studied [161 CHI E2], an( l Broutin & Marckert [6j 
have investigated the asymptotic behavior of uniformly distributed trees with prescribed degree 
sequence. In [21], we introduced a new type of conditioning involving the number of leaves of 
the tree in order to study a specific discrete probabilistic model, namely dissections of a regular 
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polygon with Boltzmann weights. The results contained in the present article are important 
for understanding the asymptotic behavior of the latter model (see |9l EI])- The more general 
conditioning on having a fixed number of vertices with degree in a given set has been considered 
very recently by Rizzolo |31j . The results of the present work were obtained independently of 
|31j (see the end of this introduction for a discussion of the relation between the present work 
and [M]). 

Before stating our main results, let us introduce some notation. If fi is a probability distri- 
bution on the nonnegative integers, P M will be the law of the Galton- Watson tree with offspring 
distribution \x (in short the GW M tree). Let £(r) be the total number of vertices of a tree r and 
let A(r) be its number of leaves, that is the number of individuals of r without children. Let A be 
a non-empty subset of N = {0, 1,2,.. .}. If r is a tree, denote the number of vertices u G r such 
that the number of children of u is in A by Ca{ t )- Note that Cn( t ) — C( r ) an d C{o}( r ) — -M r )- 



We now introduce three different coding functions which determine r (see Definition 1.3 
for details). Let u(0),u(l), . . . u(((t) — 1) denote the vertices of r in lexicographical order. 
The Lukasiewicz path W(r) = (W n (r),0 < n < C( r )) is defined by W (r) = and for < 
n < C( r ) — 1) W n+1 (r) = W„(r) + k n — 1, where k n is the number of children of u(n). For 
< i < C(r) — 1, define Hi(r) as the generation of u(i) and set H^^(r) = 0. The height function 
H{ T ) = {Ht{T~);0 < t < C( r )) is then defined by linear interpolation. To define the contour 
function (Ct(r),0 < t < 2£(r)), imagine a particle that explores the tree from the left to the 
right, moving at unit speed along the edges. Then, for < t < 2(£(r) — 1), Ct(r) is defined 
as the distance to the root of the position of the particle at time t and we set C t (r) = for 
t G [2(C(r) - 1),2C(t)]. See Fig. |l]and|2|for an example. 

Let 9 G (l, 2] be a fixed parameter and let (X t )t>o be the spectrally positive Levy process with 
Laplace exponent E[exp(— XX t )] = exp(t\ 9 ). Let also pi be the density of X\. For 6 = 2, note 
that X is a constant times standard Brownian motion. Let X exc = (X t exc )o<t<i be the normalized 
excursion of X and H exc = (i? t exc ) <t<i its associated continuous-time height function (see Section 



5.1 for precise definitions). Note that H exc is a random continuous function on [0, 1] that vanishes 
at and at 1 and takes positive values on (0, 1), which codes the so-called ^-stable random tree 

(see nm 



We now state our main results. Fix 9 G (1, 2]. Let /i be an aperiodic probability distribution 
on the nonnegative integers. Assume that is critical (the mean of fi is 1) and belongs to the 
domain of attraction of a stable law of index 9 e (1, 2]. 

(I) Let d > 1 be the largest integer such that there exists b G N such that supp(|u)\{0} is 
contained in b + cfZ, where supp(/i) is the support of fi. Then there exists a slowly varying 
function h such that: 

P JX(r) = n) ~ M(0) 1/g Pi(0) g ? d ^~ / 1 'f ) 
ML w J n^oo ^ > FiK 1 h(n)n l / e+1 

for those values of n such that P M [A(r) = n) > 0. Here we write a n ~ b n if a n /b n — > 1 as 
n — t- oo. 

(II) For every n > 1 such that P^ [A(r) = n] > 0, let t n be a random tree distributed according to 
P/u[' I -^( r ) = n ]- Then there exists a sequence of positive real numbers (-B n ) n >i converging 
to oo such that 
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converges in distribution to (X exc , H exc , H exc ) as n — > oo. 
At the end of this work, we explain how to extend (I) and (II) when the condition "A(r) = n 



is replaced by the more general condition "d(r) = n" (see Theorem 8.1). However, we shall 
give detailed arguments only in the case of a fixed number of leaves. This particular case is less 
technical and suffices in view of applications to the study of random dissections. 

We now explain the main steps and techniques used to establish (I) and (II) when A = {0}. 
Let v be the probability measure on Z defined by vik) = fi(k + 1) for k > — 1. Our starting 
point is a well-known relation between the Lukasiewicz path of a GW M tree and an associated 
random walk. Let (W n ; n > 0) be a random walk started at with jump distribution v and 
set C = inf{n > 0; W n = — 1}. Then the Lukasiewicz path of a GW M tree has the same law as 
(Wo, W\, . . . , Wf). Consequently, the total number of leaves of a GW M tree has the same law as 
J2k=i l{Wfc-Wfc_i=-i}- By noticing that this last sum involves independent identically distributed 
Bernoulli variables of parameter /i(0), large deviations techniques give: 



P 



A(t) = n and 



>C(r 



,3/4 



< e 



-Cy/n 



for some c > 0. This roughly says that a GW^ tree with n leaves has approximately n/fi(0) 
vertices with high probability. Since GW M trees conditioned on their total progeny are well 
known, this will allow us to study GW M trees conditioned on their number of leaves. 

Let us now explain how an asymptotic estimate for P M [A(r) = n] can be derived. Define A(n) 

by: 

A(n) = Card{0 < i < n - 1; W i+1 -W { = -1}. 

The crucial step consists in noticing that for n,p > 1, the distribution of (Wo, W\, . . . , W p ) under 
the conditional probability measure P[- \W P = — 1, A(p) = n] is cyclically exchangeable. The 
so-called Cyclic Lemma and the relation between the Lukasiewicz path of a GW M tree and the 



random walk W easily lead to the following identity (Proposition 1.6): 



P M [C(r) = p, \(t) = n] = i P[A(p) = n , W p = -1] = i P[5 P = n] ¥[W' p _ n = n - 1], (2) 

where 5p is the sum of p independent Bernoulli random variables of parameter /x(0) and W is 
the random walk W conditioned on having nonnegative jumps. From the concentration result 
([T]) and using extensively a suitable local limit theorem, we deduce the asymptotic estimate (I). 

The proof of (II) is more elaborate. The first step consists in proving the convergence on 
every interval [0, a] with a G (0,1). To this end, using the large deviation bound ([!]), we first 
prove an analog of (II) when t n is a tree distributed according to P jU [ ■ |A(r) > n]. We then use an 
absolute continuity relation between the conditional probability measure P M [ • | A(r) = n] and the 
conditional probability measure P M [- | A(r) > n] to get the desired convergence on every interval 
[0, a] with a G (0, 1). The second step is to extend this convergence to the whole interval [0, 1] 
via a tightness argument based on a time-reversal property. In the case of the Lukasiewicz path, 
an additional argument using the Vervaat transformation is needed. 
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As an application of these techniques, we study the distribution of the maximum degree in a 
Galton- Watson tree conditioned on having many leaves. More precisely, if r is a tree, let A(r) 
be the maximum number of children of a vertex of r. Let also A(X exc ) be the largest jump of 
the cadlag process X exc . Set D(ri) = max{fc > 1; fi([k, oo)) > For every n > 1 such that 

P^ [A(r) = n] > 0, let t n be a random tree distributed according to P M [ ■ | A(r) = n]. Then, under 
assumptions on the asymptotic behavior of the sequence (/i(^) 1//n ) n >i i n the finite variance case 



(see Theorem 7. 1 ) 



(i) If the variance of \x is infinite, then /i(O) 1/0 A(t n )/ J B n converges in distribution towards 

A(X exc ). 

(ii) If the variance of /x is finite, then A(t n ) /D(n) converges in probability towards 1. 

The second case yields an interesting application to the maximum face degree in a large uniform 
dissection (see [S]). Let us mention that using generating functions and saddle-point techniques, 
similar results have been obtained by Meir and Moon [26] when t n is distributed according to 
P M [- | C( r ) = ti]. Our approach can be adapted to give a probabilistic proof of their result. 

We now discuss the connections of the present article with earlier work. Using different 
arguments, formula ^ has been obtained in a different form by Kolchin jTS]. The asymptotic 
behavior of P M [C/i( r ) = n ] has been studied in |27l EH [29] when Card(.A) = 1 and the second 
moment of /j is finite. Absolute continuity arguments have often been used to derive invariance 
principles for random trees and forests, see e.g. jU, [121 EH 12*2"] . 

Let us now discuss the relationship between the present work and Rizzolo's recent article 
|31| . which deals with similar conditionings of random trees. The main result of |31| considers a 
random tree distributed according to P M [- | Ca(t) = n], where it is assumed that 6 A. In the 
finite variance case, [31] gives the convergence in distribution in the rooted Gromov-Hausdorff- 
Prokhorov sense of the (suitably rescaled) tree t n viewed as a (rooted) metric space for the graph 
distance towards the Brownian CRT. Note that the convergence of the contour functions in (II), 



together with Corollary 3.3, does imply the Gromov-Hausdorff-Prokhorov convergence of trees 
viewed as metric spaces, but the converse is not true. Furthermore our results also apply to the 
infinite variance case and include the case where ^ A. 

The paper is organized as follows. In Section 1, we present the discrete framework and 
we define Galton- Watson trees and their codings. We prove ^ and explain how the local 
limit theorem gives information on the asymptotic behavior of large GW M trees. In Section 2, 
we present a law of large numbers for the number of leaves, which leads to the concentration 
formula Q. In Section 3, we prove (I). In Section 4, we establish an invariance principle under 
the conditional probability P M [ ■ | A(r) > n]. In Sections 5 and 6, we refine this result by obtaining 
an invariance principle under the conditional probability P M [- 1 A(r) = n], thus proving (II). As an 
application, we study in Section 7 the distribution of the maximum degree in a Galton- Watson 
tree conditioned on having many leaves. Finally, in Section 8, we explain how the techniques 
used to deal with the case A = {0} can be extended to general sets A. 

Acknowledgements. I am deeply indebted to Jean-Frangois Le Gall for enlightening dis- 
cussions and for many suggestions on the earlier versions of this work. I also thank Louigi 
Addario-Berry for a useful discussion concerning the case where both 8 = 2 and /i has infinite 
variance, and Douglas Rizzolo for remarks on this work. 
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Notation and assumptions. Throughout this work 9 e (1,2] will be a fixed parameter. 
We say that a probability distribution (/J*(j))j>o on the nonnegative integers satisfies hypothesis 
(Hg) if the following three conditions hold: 

(i) n is critical, meaning that X^o^M^) = 1; an d Ml) < 1- 

(ii) /i is in the domain of attraction of a stable law of index 9 e (1, 2}. This means that either 
the variance of /i is finite, or /i([j, oo)) = j~ L(j), where L : IR + — > R + is a function such 
that lim^.-s.oo L(tx)/L(x) = 1 for all t > (such a function is called slowly varying). We 
refer to [5] or [T4"] chapter 3.7] for details. 
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(iii) /i is aperiodic, which means that the additive subgroup of the integers Z spanned by 
{j; 7^ 0} is not a proper subgroup of Z. 

We introduce condition (iii) to avoid unnecessary complications, but our results can be extended 
to the periodic case. 

Throughout this text, v will stand for the probability measure defined by u(k) = p(k + 1) for 
k > — 1. Note that v has zero mean. To simplify notation, we write po instead of p(0). Note 
that p > under (Ho). 

1 The discrete setting : Galton- Watson trees 
1.1 Galton- Watson trees 

Definition 1.1. Let N = {0, 1, . . .} be the set of all nonnegative integers, N* = {1,2,...} and U 
the set of labels: 

oo 

U = |J (N*) n , 

n=0 

where by convention (N*)° = {0}. An element of U is a sequence u = U\---u m of positive 
integers, and we set \u\ = m, which represents the "generation" of u. If u — u± • • • u m and 
v — v 1 • • • v n belong to U, we write uv — U\ • • • u m V\ • • • v n for the concatenation of u and v. In 
particular, note that «0 = 0m = u. Finally, a rooted ordered tree r is a finite subset of U such 
that: 

1. G r, 

2. if v e r and v = uj for some j G N*, then u G r, 

3. for every u G r, there exists an integer k u (r) > such that, for every j G N*, uj G r if and 
only if 1 < j < k u (r). 

In the following, by tree we will always mean rooted ordered tree. We denote by the set of all 
trees by T. We will often view each vertex of a tree r as an individual of a population whose r 
is the genealogical tree. The total progeny of r will be denoted by £(t) = Card(r). A leaf of a 
tree r is a vertex u G r such that k u (r) = 0. The total number of leaves of r will be denoted by 
A(t). If r is a tree and u G r, we define the shift of r at u by T u r = {v G U; uv G r}, which is 
itself a tree. 

Definition 1.2. Let p be a probability measure on N with mean less than or equal to 1 and, 
to avoid trivialities, such that p(l) < 1. The law of the Galton- Watson tree with offspring 
distribution p is the unique probability measure P p on T such that: 

1. P P (A;0 = j) = p(j) for j > 0, 

2. for every j > 1 with p(j) > 0, the shifted trees Tit, . . . ,TjT are independent under the 
conditional probability P p ( • | k$ — j) and their conditional distribution is P p . 

A random tree whose distribution is P p will be called a Galton- Watson tree with offspring dis- 
tribution p, or in short a GW p tree. 
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In the sequel, for an integer j > 1, P w - will stand for the probability measure on TP which 
is the distribution of j independent GW^ trees. The canonical element of TP will be denoted 
by f. For f = (n, . . . , Tj) E TP, set A(f) = A(n) + ■ ■ ■ + Afa) and ((f) = C(n) + • ■ • + Cfa) for 
respectively the total number of leaves of f and the total progeny of f. 

1.2 Coding Galton- Watson trees 

We now explain how trees can be coded by three different functions. These codings are crucial 
in the understanding of large Galton- Watson trees. 




10 20 30 40 



Figure 1: A tree r with its vertices indexed in lexicographical order and its contour function 
(C u (t); 0<u< 2(C(r) - 1). Here, C(r) = 26. 



Figure 2: The Lukasiewicz path W(r) and the height function H(t) of r. 



Definition 1.3. We write u < v for the lexicographical order on the labels U (for example 
< 1 < 21 < 22). Consider a tree r and order the individuals of r in lexicographical order: 
= u (o) < u (l) < ... < u(C(t) - 1). The height process H(t) = (H n (r),0 < n < C(r)) is 
defined, for < n < ((t), by: 

H n (r) = \u(n)\. 

For technical reasons, we set H^ T )(r) = 0. 

Consider a particle that starts from the root and visits continuously all edges at unit speed 
(assuming that every edge has unit length), going backwards as little as possible and respecting 
the lexicographical order of vertices. For < t < 2(£(t) — 1), C t (r) is defined as the distance 
to the root of the position of the particle at time t. For technical reasons, we set C t (r) = for 
t G [2(£(t) — 1), 2£(r)]. The function C(r) is called the contour function of the tree r. See Figure 
[Tlfor an example, and [T2| Section 2] for a rigorous definition. 
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Finally, the Lukasiewicz path W(r) = (W n (r),0 < n < ((t)) of r is defined by W (r) = 
and for < n < £(r) — 1: 

W n+1 (r) = W n (r) + k u(n) (r) - 1. 
Note that necessarily W^( T )(r) = — 1. 

A forest is a finite or infinite ordered sequence of trees. The Lukasiewicz path of a forest is 
defined as the concatenation of the Lukasiewicz paths of the trees it contains (the word "concate- 
nation" should be understood in the appropriate manner, see [12] Section 2] for a more precise 
definition). The following proposition explains the importance of the Lukasiewicz path. 

Proposition 1.4. Fix an integer j > 1. Let (W n ; n > 0) be a random walk which starts at 
with jump distribution u(k) = fi(k + 1) for k > —1. Define Q = inf{n > 0; W n = —j}. Then 
(Wq, Wi, . . . , W$.) is distributed as the Lukasiewicz path of a forest of j independent GW^ trees. 
In particular, the total progeny of j independent GW M trees has the same law as (j . 

Proof. See |23| Proposition 1.5]. □ 

Note that the previous proposition applied with j = 1 entails that the Lukasiewicz path of 
a Galton- Watson tree is distributed as the random walk W stopped when it hits —1 for the 
first time. We conclude this subsection by giving a link between the height function and the 
Lukasiewicz path (see [2Hj Prop. 1.2] for a proof). 

Proposition 1.5. Let t be a tree. Then, for every < n < C( r )-' 

H n (r) = Card (o < j < n; W,(r) = inf W k {r)\ . (3) 



1.3 The Cyclic Lemma 

We now state the Cyclic Lemma which is crucial in the derivation of the joint law of (C(r), A(r)) 
under P^. For integers 1 < j ' < p, define: 

= {(x u ...,x p )e {-1, 0, 1,2,.. .f; £ Xi = -j} 

i=l 

and 

— ( -1 m 

§; j = {(x 1; • • • , x p ) G Sj»; ]>> > —j for all m e {0, 1, . . . ,p - 1}}. 



i=i 



For x = (xi, . . . , x p ) G §>p^ and i G Z/pZ, denote by the i-th cyclic shift of x defined by 



i+k mod p for 1 < k < p. For x G §W, finally set: 



fc ^i+k mod p 



i G Z/pZ; x (4) G S 



(0 c 



V 



The so-called Cyclic Lemma states that we have Card(J x ) = j for every x G (see [30l Lemma 
6.1] for a proof). 

Let (VK n ;n > 0) and ^- be as in Proposition 1.4 Define A(k) by A(fe) = Card{0 < % < 
k — 1; Wj+i — Wi = —1}. Let finally n,p > 1 be positive integers. From the Cyclic Lemma and 
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the fact that for all k £ Z/pZ one has Card{l < i < p; Xi = — 1} = Card{l < i < p\ xf^ = — 1}, 
it is a simple matter to deduce that: 



F[Q = p, A(p) = n] = 3 -¥[W p = -j, A(p) = n]. 



(4) 



See e.g. |30| Section 6.1] for similar arguments. Note in particular that we have F[Q — p] — 
j¥\W p = —j]/p. This result allows us to derive the joint law of (C( r ) ; -M r )) under P M : 

Proposition 1.6. Let j and n < p be positive integers. We have: 

^A((f) = V, Kf) = n] = J p nS p = n] F[W;_ n =n-j}. 

where S p is the sum of p independent Bernoulli random variables of parameter fi and W is the 
random walk started from with nonnegative jumps distributed according to r](i) = fi(i + 1)/(1 — 
/io) for every i > 0. 

Proof. Using Proposition 1.4 and Q , write P Mi j[£(f) = p, A(f) — n] — j¥[A(p) = n, W p = —j]/p. 
To simplify notation, set Xi = W{ — Wi-\ for i > 1 and note that: 



F[A(p) = n,W p = -j] 



F[X i = -l,Vie{i 1 ,...,i n }] 

l<i\<--<i n <p 



¥ J2 X i = n-j- X, , -1. V/./ {/ ; i,.} 

The last probability is equal to P [Wl_ n = n — j] P [X ; > — 1, Vi {ii, . . . , i n }] and it follows 
that: 

P [A(p) = n, W p = -j] = P [W;_ n = n-j]V [S p = n] , (5) 

giving the desired result. □ 



1.4 Slowly varying functions 

Slowly varying functions appear in the study of domains of attractions of stable laws. Here we 
recall some properties of these functions in view of future use. 

Recall that a nonnegative measurable function L : R + — > IR + is said to be slowly varying if, 
for every t > 0, L(tx)/L(x) — > 1 as x — > oo. A useful result concerning these functions is the 
so-called Representation Theorem, which states that a function L : IR + — > M + is slowly varying 
if and only if it can be written in the form: 

J duj , x > 0, 

where c is a nonnegative measurable function having a finite positive limit at infinity and e is 
a measurable function tending to at infinity. See e.g. [5, Theorem 1.3.1] for a proof. The 
following result is then an easy consequence. 

Proposition 1.7. Fix e > and let L : IR + — > K + be a slowly varying function, 
(i) We have x e L(x) — > oo and x~ e L(x) — > as x — )■ oo. 

(ii) There exists a constant C > 1 snc/i that ^x~ e < L(nx)/L(n) < Cx e for every integer n 
sufficiently large and x > 1 . 
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1.5 The Local Limit Theorem 



Definition 1.8. A subset A C Z is said to be lattice if there exist b G Z and d > 2 such that 
A C 6 + c/Z. The largest d for which this statement holds is called the span of A. A measure on 
Z is said to be lattice if its support is lattice, and a random variable is said to be lattice if its 
law is lattice. 

Remark 1.9. Since /i is supposed to be critical and aperiodic, using the fact that /i(0) > 0, it 
is an exercise to check that the probability measure v is non-lattice. 

Recall that (X t )t>o is the spectrally positive Levy process with Laplace exponent E[exp(— XX t )] 



exp(t\ e ) and p\ is the density of X\. When 9 = 2, we have Pi(x) = e x / 4 /y/4ff. It is well known 
that pi is positive, continuous and bounded (see e.g. |34^ I. 4]). The following theorem will allow 



us to find estimates for the probabilities appearing in Proposition 1.6 



Theorem 1.10 (Local Limit Theorem). Let (H^ n ) n >o be a random walk on Z started from 
such that its jump distribution is in the domain of attraction of a stable law of index 9 G (1, 2]. 
Assume that W\ is non-lattice and that P[Wi < -1] = 0. Set K(x) = E[W 2 l\ Wl \<x] for x > 0. 
Let a 2 be the variance of W\ and set: 



(T(l-9)) 1/0 mi\x>O; P [Wj > x] < 



n K I sup < z > 



K{z) 
' z 2 



> 



n 



if a 2 < oo, 

if a 2 = oo and 9 < 2, 
if a 2 = co and 9 = 2, 



with the convention sup0 = 0. 

(i) The random variable (W n — rtE[Wi])/a n converges in distribution towards X\. 
(ii) We have a n = n l l e L{n) where L : M + — > IR + is slowly varying. 

'Jfe-nE[Wi] 



(in) We have lim sup 



a n F[W n — k]—pi 



0. 



Proof. First note that E[|Wi|] < oo since 9 > 1 (this is a consequence of [17, Theorem 2.6.1]). 

We start with (i). The case a 2 < co is the classical central limit theorem. Now as- 
sume that a 2 = co and 9 < 2. Write G(x) = P[|Wi| > x] for x > and introduce a' n = 
inf {x > 0; G(x) < l/n}, so that a n = (r(l — 9)) 1 ^ 9 a' n for n sufficiently large. By [14, Formula 
3.7.6], we have nG(a' n ) — > 1. By definition of the domain of attraction of a stable law, there exists 
a slowly varying function L : IR + — > IR + such that G(x) = L(x)/x e . Hence G(a n ) ~ l/(nT(l — 9)). 
Next, by US Section XVII (5.21)] we have K(x) ~ x 2 G(x)9/(2 - 9) as x ->• oo. Hence: 



nK(a n ) 



n 



9 



al2-9 



a 2 l G(a n ) 



9 



(2-0)r(i 



From |T5l Section XVII. 5, Theorem 3], we now get that (W n — riE[Wi])/a n converges in distribu- 
tion to X\. Finally, in the case a 2 = co and 9 = 2, assertion (i) is a straightforward consequence 
of the proof of Theorem 2.6.2 in |17| . 



10 



We turn to the proof of (ii). By pT7] p. 46], for every integer k > 1, a^ n ja n — > k l l e asn-f oo. 
Since (a n ) is increasing, by a theorem of de Haan (see (5j Theorem 1.10.7]), this implies that 
there exists a slowly varying function L : M + — > K + such that a n = for every positive 

integer n. 

Assertion (hi) is the classical local limit theorem (see [TT] Theorem 4.2.1]). □ 



oo and 9 = 2, note that L(n) — > oo as n — >■ oo and that L can be chosen to 



In the case a 2 
be increasing. 

Assume that fi satisfies (He) for a certain 9 G (1,2]. Let (H / n ) n >o be a random walk started 
from with jump distribution v. Since /i is in the domain of attraction of a stable law of index 
9, it follows that v is also in this domain of attraction. Moreover, E [Wi] = and W\ is not 



lattice by Remark 1.9 Let a be the variance of W\ and define B n to be equal to the quantity 



a n defined in Theorem 1.10 Then, as n — > oo, W n /B n converges in distribution towards X\. In 
what follows, h : K + — > R + will stand for a slowly varying function such that B n = h(n)n 1 / 6 '. 



Lemma 1.11. We have: 
Mr] 



n 



Pi(0) 



n 



l/e+i 



h(n) ' 



P M [C(r) > n] 



l(0) 



n 1 / e /i(n) ' 



Proof. This is an easy consequence of Theorem 1.10 (iii) together with the fact that P M [<^(r 



P [W n = — 1] /n, as noticed before Proposition 1.6 



□ 



Remark 1.12. In particular, F^[((t) = n] > for n sufficiently large if \i is aperiodic. When /i 
is periodic, if d is the span of the support of /i, one can check that for n sufficiently large, one 
has [((t) = n] > if and only if n = 1 mod d. 



2 A law of large numbers for the number of leaves 

In the sequel, we fix 9 G (1,2] and consider a probability distribution (ttonN satisfying hypothesis 
(Hq). In this section, we show that if a GW /t tree has total progeny equal to n, then it has 
approximatively /i n leaves with high probability. Intuitively, this comes from the fact that each 
individual of a GW M has a probability /i of being a leaf. Conversely, we also establish that if a 
GW M tree has n leaves, than it has approximatively n//xo vertices with high probability. 



Definition 2.1. Consider a tree r G T and let (u(i), < i < C( r ) — 1) be the vertices of r 
listed in lexicographical order and denote by kj the number of children of u(j). For < s < ((t) 
define A r (s) by A T (s) = X)}=o l{fc =o} ; where [s\ stands for the integer part of s. Set also 
A T (C(r)) = A(r). 



xln^+ 1 



x) In jzj^ ■ The following 



Lemma 2.2. Let (Xj)j>i be a sequence of independent identically distributed Bernoulli random 
variables of parameter ^lq. For < x < 1, define <p*(x) 
two properties hold: 

(i) For a > and n > 1: 



1 n 

- V X k > no + a 



P 



1 n 
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(ii) We have 4>*(fio + x ) — ; x^ 2 + o(x 2 ) when x — > 0. 

2fi (l - fio) 

Proof. For the first assertion, see [TT| Remark (c) in Theorem 2.2.3]. The second one is a simple 
calculation left to the reader. □ 

Definition 2.3. Let e > 0. We say that a sequence of positive numbers (x n ) is oe e (n) if there 
exist positive constants c, C > such that x n < Ce~ cn ' for all n and we write x n = oe £ (n). 

Remark 2.4. It is easy to see that if x n = oe e (n) for some e > then the sequence (y n )n>l 
defined by y n = J2T=n x k is a l so oe e (n). 

Lemma 2.5. Fix < n < 1 and 5 > 0. 

(i) Let (W n — Xi + ■ • ■ + X n ; n > 0) be a random walk started at with jump distribution 
u(k) = n(k + 1), k > — 1 under P. Then: 



sup 

?7<t<l 



\nt\ 

l {Xj=-i} - Mo 

3=0 



> 



71 



5 



oei/ 2 ( 



n 



(ii) For those values of n such that P M [((t) = n] > we have: 



P„ 



sup 

T)<t<l 



A T (nt) 



nt 



Mo 



> 



8 



n 



1/4 



C(r) = n 



oe 1/2 (n) 



Proof. For the first assertion, define Z\. = | Z)i=o l{Xj=-i} — Mo f° r k > 1. By Lemma 2.2 
for n sufficiently large we have cjf f/j ± 5n -1 / 4 ) > cn _1//2 , for some c > 0. Since the random 
variables (l{x =-i})i>i are independent Bernoulli random variables of parameter /^o, for large n 
and > \j]n\ we have by Lemma 2.2 (i): 



F[Z k > 5n- 1/4 } < 4exp 
Therefore, for large enough n: 



k 



1/2 



n 



P 



sup 

T7<t<l 



[nt\ 



nt 



Yl kx^i} - mo 



j=0 



> 



n 



1/4 



7771 — 1 

< 4exp ( -c^y^- ) <4exp 



.^ n i/2 
2 



< P 



n x/4 



< E p 



3A; G [rm - 1, n) D N such that Z fc > — 
6 



Z k > 



n 



1/4 



< 4(1 — 77)71 exp 



^ n l/2 



which is oexjiin). 

For the second assertion, introduce ( = inf{n > 0; W n = —1} and use Proposition 1.4 which 
tells us that: 



P„ 



sup 

_v<t<i 



A T (nt) 



nt 



Ik) 



> 



n 



i/i 



C(r) = n 



P 



sup 

rj<t<l 



< 



-P 



n 



[nt\ 

E 1 {^=-i} ~ Mo 

3=0 

\nt\ 



> 



n 



1/4 



sup 

?7<t<l 



— Y, 1 {x J =-i} - Mo 

m j=0 



( = n 
5 



> 



n 



1/4 
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By (i), the l ast probability in the right-hand side is oex^in) and by Lemma |1 . 1 lj combined with 

,-i/6-2 £ Qr i ar g e 



Proposition 1.7 (ii), the quantity P [£ = n) = P^ [C( r ) = n ] is bounded below by n 
n. The desired result follows. 



□ 



Corollary 2.6. We have for every r\ G (0, 1) and 5 > 0: 



P„ 



sup 

»7<t<l 



K(((r)t) 



C(r)t 



> 



Proof. To simplify notation, set A n = < sup 

U<*<1 



n l/4 

C(r)t 



C(r) > n 



oe 1/2 (n). 



> 



i/i 



. It suffices to notice that: 



P, [K I CM > n] < I % {^j - ^ [A k | C (r) = k] 



oberving that the quantities P M [Afc|C(T) = k] are bounded by Lemma 2.5 (ii). Details are left to 
the reader. □ 

We have just shown that if a GW M tree has total progeny n, then it has approximatively fi^n 
leaves and the deviations from this value have exponentially small probability. Part (ii) of the 
following crucial lemma provides a converse to this statement by proving that if a GW M tree has 
n leaves, then the probability that its total progeny does not belong to [n/fiQ — n 3 / 4 , n/ ixq + n 3//4 ] 
decreases exponentially fast in n. 



Lemma 2.7. We have for 1 < j < n and 5 > 0: 



Kf) 



n 



> -TJl and (if) = 71 



(ii) P 



i-'-j 



X(f) = n and 



((f) 



n 



>C(f) 



3/4 



oei/2(n), uniformly in j . 
— oex/2(n), uniformly in j . 



Proof. The proof of assertion (i) is very similar to that of Lemma 2.5 The only difference is the 
fact that we are now considering a forest, but we can still use Proposition 1.4. We leave details 
to the reader. 

Let us turn to the proof of the second assertion, which is a bit more technical. First write: 



P ■ 



A(f) = n, 



C(f) 



n 



/'o 



>C(f) 



3/4 



P 



I-'-J 



n 



A(f) = n, C(f) > - + C(f) 3/4 



+ P 



H,3 



n 



A(f) = n, C(f) < - - C(f) 3/4 



Denote the first term on the right-hand side by I n and the second term by J n . We first deal with 



/„ and show that I n = oe\/2{n). We observe that: 



^ < E W) < M - M 3/4 , C(f) = k] . 



k=n 



Assertion (i) implies that P^j [A(f) < [i$k — jj,ok 3 ^ 4 , ((f) = k] = oe\/2(k), and this entails that 
In = oe 1/2 (n). 
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We complete the proof by showing that J n = oei/ 2 (n). Write: 



Jn < E IP/uJ 
k=n 



L A ( f ) ^ ^0 

c()= ^ 



By Lemma 2.2 (ii), we have (jf (/zo + ^k */ 4 ) > C2& for some C2 > and for every k > n, 
provided that n is sufficiently large. Then, using Proposition 1.6 and Lemma 2.2 (i): 

E 

k=n n p=l 



£ 1 {^ P =-i} > Mo + £1/4 



L™/wJ 

< £ 2exp{-c 2 k 1/2 ) 

k=n 



which is oe.\ii{n). 



□ 



3 Estimate for the probability of having n leaves 

In this section, we give a precise asymptotic estimate for the probability that a GW^ tree has n 
leaves. This result is of independent interest, but will also be useful when proving an invariance 
principle for GW M trees conditioned on having n leaves. 

Recall that // is a probability distribution on N satisfying hypothesis (Hg) with 9 e (1,2]. 



Recall also that h is the slowly varying function that was introduced just before Lemma 1.11 



Theorem 3.1. Let supp(fi) be the support of fi and let d > 1 be the largest integer such that 
supp(fi)\{0} is contained in b+dZ for some b G N. Then choose b minimal such that the preceding 
property holds. 

(i) There exists an integer N > such that the following holds. For every n > N, P M [A(r) = n + 
if, and only if, n is a multiple of gcd(6 — 1, d). 

(ii) We have: 

ro \\( \ , 11 V* fn ^cd{b-l,d) 

P M [A(r) = n + l] ^ Pl (0) h{n)nl/e+1 , 

whenn tends to infinity in the set of multiples o/gcd(6 — 1, d). Here we recall that pi is the 
continuous density of the law of X\, where (X t )t>o is the spectrally positive Levy process 
with Laplace exponent E[exp(— XX t )] = exp(tX e ). 

In particular, when the second moment of \x is finite : 

A*o gcd(6 - 1, d) 



P^ [A(r) = n + 1] 



n—>oo 



2/TO" 2 n 3 / 2 

when n tends to infinity in the set of multiples of gcd(6 — 1, d). 

Note that supp(/i)\{0} is non-lattice if and only if d — 1. It is crucial to keep in mind that 
even if fi is aperiodic, supp(/i)\{0} can be lattice (for example if the support of fi is {0,4, 7}). 



Remark 3.2. In the case where fi has finite variance, Theorem 3.1 is a consequence of results 
contained in [27]. 
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Before giving the proof of Theorem 3.1, let us mention a useful consequence. 
Corollary 3.3. Fix 5 > and i] E (0, 1). We have: 



P, 



sup 

n<t<i 



MC(r)f) 



C(r)t 



> 



5 



1/4 



A(r) 



n 



oei /2 (n), 



w/ien n — 1 tends to infinity in the set of multiples of gcd(6 — 1, d). 



This bound is an immediate consequence of Corollary 2.6 once we know that P M [A(r) 
decays like a power of n. 



n\ 



3.1 The Non-Lattice case 

We consider a random variable Y on N with distribution: 



1 - /i 



1 



i > 0. 



(6) 



We will first establish Theorem 3A_ when Y is non-lattice, that is b = and d = 1 in the notation 
of Theorem 13.11 



In agreement with the notation of Proposition 1.6, we consider the random walk W defined 
as W conditioned on having nonnegative jumps. In particular, W' n is the sum of n independent 
copies of the random variable Y, which is in the domain of attraction of a stable law of index 9. 
Indeed, when a 2 = oo, this follows from the characterization of the domain of attraction of stable 
laws (see |T7l Theorem 2.6.1]). When a 2 < oo, formula ^ shows that Y has a finite second 
moment as well. 



Consequently, if we write B' n for the quantity corresponding to a n in Theorem 1.10 when W 



is replaced by W', we have, by Theorem 1.10| (iii): 



lim sup 



B' n ¥[W' n = k] - pi 



k - nE[Y] 
BL 



0. 



(7) 



Moreover, there exists a slowly varying function h' : IR + — > IR + such that B' n = h'{n)n 1 / 6 and 
h'{n) — > oo as n — > oo when both o 2 = oo and 6 = 2. In the case where the second moment of fi 
is finite, we have B' n = where a' 2 is the variance of Y. Note also that K[Y] = /io/(l — /io)- 

The following lemma establishes an important link between h and h! . 

Lemma 3.4. If o 2 = oo we have lim B' n /B n = lim h'(n)/h(n) = (1 — /i ) -1 / 6 '. 



Proof. First assume that 9 < 2. Since P[Y > x] = j^-F[Wi > x] for x > 0, by Theorem 
(i), we have for n large enough: 



1.10 



B' =Y(l-6) l l e ini\x > 0; 



>x]<-)= T(l - 9) 1/e inf \x > 0; PfV^ > x] < 1 

n J [ n J 

Thus -B[n./(i— yu )j < B' n < S|- n /(i_^ Q )-|, and the conclusion easily follows. The proof in the case 
= 2 is similar and is left to the reader. □ 

We will use the following refinement of the local limit theorem (see [33, Chapter 7, P10] for 
a proof). 
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Theorem 3.5 (Strong Local Limit Theorem). Let Z = (Z n )„> be a random walk on Z with 
jump distribution p started from 0, where p is a non-lattice probability distribution on Z. Assume 
that the second moment of p is finite. Denote the mean of p by m and its variance by a 2 . Set 
a n = G\jnj2. Then: 



lim sup ( 1 V 



(x — mn) : 



n 



a n F[Z n 



x\ - pi 



x — mn 



0. 



Proof of Theorem 3.1 when Y is non-lattice. We first show that /i'(n)n 1 / e ' +1 P At [A(r) = n] con- 
verges to a positive real number. Fix e G (0, 1/2) and write: 



P,[A(r) 



n\ 



P, 



A(r) = n, (l-e)-<C(r)<(l + e)- 
Ho Ho 



A(r) 



n, 



HoC{r) 



n 



> e 



By Proposition 1.7 there exists C > such that h'{n) < Cn for every positive integer n. 
Moreever, for n large enough, for every x > 0, the property \xpo/ n — 6 implies \x — n/po\ > 
x 3//4 . Consequently: 



h\n)n l / e+1 ¥ h 



A(r) 



n, 



Ho({ T ) 



- 1 



7>. 



> e 



A(r) 



n. 



C(r)- 



/'o 



N3/4 



which is oei/ 2 (n) by Lemma 2.7 (ii). It is thus sufficient to show that: 



h'{n)n 1/9+ % 



A(r)=n, (l-e)-<C(r)<(l + e)- 



converges to a positive real number. 

In the following, S p will denote the sum of p independent Bernoulli variables of parameter fj, . 
Note that Si is non-lattice. The key idea is to write the quantity appearing in <^ as a sum, then 
rewrite it as an integral and finally use the dominated convergence theorem. For denote 
the smallest integer greater than or equal to x by \x\. To simplify notation, we write 0(1) for a 
bounded sequence indexed by n and o(l) for a sequence indexed by n which tends to 0. Using 
Proposition 1.6, we write: 



h'{n)n x ^ x ¥^ 



A(r) 



h'(n)n 



1/9+1 



-n, (l- e )^<C(r)<(l + e)- 
Ho Ho 

L(l+e)n/ M0 J 



E 



P=\{l-t)n/na\ 



P 



F[S P = n}F[W' = n - 1] 



ijn+0(l) , h'(n)n^ 0+1 

- - 

^n+0(l) L^//^0 + xJ 

^+o(i) ^ih'(n)n l l e+l 



[n/^o+x\—n 



71-1] 



v^D^ K^ P|S [*H =-] F ^o + ^j-n = "- 1 ] ( 9 ) 



Using the case # = 2 of Theorem 1.10 (iii), for fixed mGK, one sees that: 

1 



lim 

n— >oo 



Ho) 



-e 2 (!-to) 
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We now claim that there exists a bounded function F : R — > R + such that: 

Jim ti{n)n^nW{ n/ , 0+uVn] - n = n - 1] = 
for every fixed !i£i We distinguish two cases. When a 2 = oo, we have by (J7]): 



(10) 



lim h' '(n)rz 1 



n - 1] 

Mo 
1 - Mo 

__Mo_ 
1 - Mo 



lim pi 



Pi(0). 



n — 1 



_M_o_ 
1-Mo 



MO 



— n i 



In the case 6 = 2, we use the property that ft/(n) — > oo as n — > oo. When a 2 < oo, ([7| gives: 



lim a'Jn/2F\W\ , = n - 11 



Mo 



1 — MO 



lim pi 



n — 1 



MO 
1— Mo 



Mo 



(7 



1 - Mo 1 V o"' V 1 - Mo 



MO 
3/2 



Ma 



\/n — n/y/2 



u 



In both cases, we have obtained our claim (10). 
Next, for n > 1 and mGK, define : 



/n(«) 



71 



g n (u) = l {H< ^ } B' n F[W' l 



n 



-!]• 



The strong version of the Local Limit Theorem (Theorem 3.5) implies that there exists C > 
such that |/ n (w)| < C min (l, JjJ for all n > 1 and u G R. To bound g n , write: 

Proposition \1.7\ (ii) implies that there exists C > such that B' n / B'^ ^ +Uv ^j n — C' for every 



n sufficiently large and \u\ < j^V™, and then (jTJ) entails that there exists C > such that for all 
n > 1 and u e R we have |<7 n (w)| < C By the preceding bounds on / n and g n , we can apply the 
dominated convergence theorem to the right-hand side of ^ and we get: 



lim /i'(n)7i 1/9+1 P M [A(r) = n] = Mo 



+oo 



1 



efw — , 

-oo \/27r(l -Mo) 



2 



e 2 (!-M0) 



'ii: 



Finally, we need to identify the value of the integral in (11) and to express b! in terms of h. 
We again distinguish two cases. First suppose that a 2 < oo. An explicit computation of the 



right-hand side of (11) gives 



(f 
V2 



71 



3/2 P M [A( 



r = n 



- Mo^ 

>~ ]l Att /i /(l -Mo) + ^ ,2 (1 -Mo) 



17 



A simple calculation gives a' 2 = (a 2 — /U )/(l — /io) — (a*o/(1 — /^o)) 2 ; which entails: 



[A(r) 



^0 ^-3/2^ 



2ttct 2 



3.4 



we 



When a 2 = oo, we have F(u) = (/io/(l — A*o)) 8 Pi(0) so that (11) immediately gi ves t hat 
h'(n)n 1 ' 6+1 ¥ M [A(r) = n] converges towards (//o/(l — A i o)) 5 Pi(0) as n — >• oo. By Lemma 
conclude that: 

^[A(r)=n] ~ 4pi(0) 



1 



/i(n)nV»+i- 

Note that this formula is still valid for 9 = 2. This concludes the proof in the non-lattice case. □ 



3.2 The Lattice case 



We now sketch a proof of Theorem 3.1 when Y is lattice. 



Proof of Theorem 3A_ when Y is lattice. For (i), by Proposition 1.6, P M [A(r) = n + 1] > if and 
only if there exists > such that P[Wj£ = to] > 0. As a consequence, P M [A(r) = to + 1] > if 
and only if to can be written as a sum of elements of supp(F). Since supp(F) C b — 1 + gK, it 
follows that P^ [A(r) = to + 1] = if n is not divisible by gcd(6 — 1, d), and it is an easy number 



theoretical exercise to show that there exists N > such that for n > N, P M [A(r) 



n 



1] > 



if n is a multiple of gcd(6 — l,d). 

The asymptotic estimate of (ii) is obtained exactly as in the non-lattice case by making use 
of the Local Limit Theorem for lattice random variables (see e.g. [TTf Theorem 4.2.1]). We omit 
the argument to avoid technicalities. □ 

Remark 3.6. Let us briefly discuss the extension of the preceding results to the case where \x 
is periodic. In this case, Y is necessarily lattice. Indeed, the property supp(/i) C rfZ implies 
supp(F) C gK — 1. The same reasoning as above shows that Theorem 3.1 remains valid in this 



case. However, the span of supp(F) is not necessarily equal to the span of supp(/x). Consequently, 
P^ [A(t) = to] = can hold for infinitely many to (for example if the support of \i is {0, 28, 40, 52}) 
or for finitely many to (for instance if the support of /x is {0, 3, 6}). 



4 Conditioning on having at least n leaves 

In this section, we show that the scaling limit of a GW^ tree conditioned on having at least to 
leaves is the same (up to constants) as that of a GW M tree conditioned on having total progeny 
at least to. The argument goes as follows. By the large deviation result obtained in Section 2 
(which states that if a GW M tree has to leaves, then the probability that its total progeny does 
not belong to [n//j — to 3//4 ,to//!o + to 3//4 ] decreases exponentially fast in to), we establish that the 
probability measures P /i [ • | £(r) > to] and • | A(r) > Hqu — to 3//4 ] are close to each other for large 
to. The fact that the rescaled contour function of a GW M tree under P /1 [ ■ | £(r) > to] converges in 
distribution then allows us to conclude. 

Henceforth, if / is a closed subinterval of IR + , 6(7, R) stands for the space of all continuous 
functions from / to R + , which is equipped with the topology of uniform convergence on every 
compact subset of I. 
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Recall that /x is a probability distribution on N satisfying the hypothesis (Hg) for some 
6 G (1,2]. Recall also the definition of the sequence (B n ), introduced just before Lemma 
Also recall the notation CAt) for the contour function of a tree r introduced in Definition 



1.11 



1.3 



Theorem 4.1 (Duquesne). There exists a random continuous function on [0, 1] denoted by H e 
such that if i n is a tree distributed according to P M [- | £(r) = n}: 



B r , 



n 



C 2 nt(in); < t < 1 



n— >oo 



(^ exc ; < t < 1) 



where the convergence holds in the sense of weak convergence of the laws on C([0, 1], 
Proof. See [T2| Theorem 3.1] or |20| . 



□ 

Remark 4.2. The random function if exc , can be identified as the normalized excursion of the 
height process associated to the spectrally positive stable process X. The notion of the height 
process was introduced in j25] and studied in great detail in |13| ; see Section 5.1 for a definition. 



Using Theorem |4.1[ we shall prove that for every bounded nonnegative continuous function 
F on C([0, 1],M) the following convergence holds: 



E, 



C(r) 



C2C(r)t(r); < t < lj 



A(r) > n 



E[F{H exc )\. 



(12) 



Recall that P M j stands for the probability measure on TP which is the distribution of j 
independent GW^ trees. 

Lemma 4.3. Fix 1 < j < n. Let U be a bounded nonnegative measurable function on TP . Then: 

K,3 [U(f)k{f)>n]-^ [Uiflhm^n-n^] I < II^IL P M l n ~ Oo' + ^W* < ((f) < 7l]+oe 1/2 (n) 

where the estimate oe\/2{n) is uniform in j . 
Proof. First note that: 



E^ [C/(f)l C (f)> n 



E 
E 



H,3 



^(f)lc(f)>nl A(f) > Mo((f) _ C{f) 3/4 

t/ ( f ) 1 C(f)>" 1 A(f)> M oC(f)-C(f) 3/4 



+ [ f/ ( f ) 1 C(f)>n 1 A(f)< W) c(f)-C(f) 3/4 
+ oe 1/2 (n), (13) 



where we have used Lemma 2.7 (i) in the last equality. 
Secondly, write: 



E« [t/(f)l 



A(f)>Mon-n 3/4 



[^( f ) 1 A(f)> M0 n-n 3 ' /4l C(f)>™] + E MJ 1 A(f)> M n-n 3 / 4 



Let C n and be respectively the first and the second term appearing in th e right-hand side. 
To simplify notation, set a(n) = [IqU — n 3 / 4 for n > 1. Then, by Lemma 2.7 (ii), we have for n 
large enough: 



A, 



E 



[/(f) 1 



A(f)>a(n),C(f)<n, 



C(f)- 



MS 



< C(f) 3/4 



+ oei /a (n) 



< E 



^(^(^(nUCfKn.cW^-^f) 374 ] + oe V2(^) 
< II^IL F ^ [n - (Mo 1 + % 3/4 < C(f) < n] + oe 1/2 (' 
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We next consider C n . Choose n sufficiently large so that the function x H- a(x) is increasing 
over [pLon — n 3 / 4 , oo) and write: 



C n = E M [f/(f)l 
< E M [f/(f)l 



C(f)>n,A(f)>a(C(f)) 
C(f)>n,A(f)>a(C(f)) 



+ E w -[C/(f)l 
+ E lhj [U(t)l 



A(f)>a(n),C(f)>n,A(f)<a(C(f)) 



C(f)>n,A(f)<a(C(f)) 



E M,i [ f/ ( f ) 1 C(f)>^,A(f)>a(c(f))] +oe 1/2 (n) 



by Lemma 2.7 



By the preceding estimates we have, for n large: 



< E M -[C/(f)l 



A(f)> W n-ti 3 /* 



E 



[/(f)l f(f) > n l A(f) > wC(f) __ c(f) 3/4 



< Halloo P« [n ~ Oh 1 + i)^ 74 < C(f) < n] + oe 1/2 (n) 



and by combining this bound with (13) we get the desired estimate. 



□ 



Proposition 4.4. Let U, {U n ) n >i : T — > R + fre uniformly bounded measurable functions, meaning 
that there exists M > snc/i £/ia£ /or all n > 1 and r G T, U n (r) < M and U(t) < M. 

(%) [C/ (t) I £(r) = n] converges whenn tends to infinity, thenE^ [U(t)\ £(t) > n] converges 

to the same limit. 

(ii) IfE^ [{7 n (r)| C(t) > n] converges whenn tends to infinity, thenE^ [[/ n (r)| A(r) > |~/io^ — n 3 ^ 4 ] 
converges to the same limit. 



Proof. Using the formula: 

E„[tf(r)|C(r)>n] 



1 



k=n 



it is an easy exercise to verify that the first assertion is true. 

We turn to the proof of (ii). Fix < rj < 1/4. By Lemma 1.11 we may suppose that n is 
sufficiently large so that P M [C( r ) > n] > c 3 n~ 1 / e_,? for a constant c 3 > 0. Next, setting again 
a(n) = fx n — n 3//4 , we have: 



< 



|E„ [U n (r)\C(r) > n] - E M [U n (r)\X(r) > a(n)}\ 
Ef, [t/n(r)l c(r) > n ] E M [C/ n (r)l A (r) 
Pm [CM > n] P M [COO > n] 

„ 1/0+7? 

-P„ [ n - + l)n 3/4 < C(r) < 



+ 



E M [C/" n (r)l A ( r )> Q ( n ) 
P M [A(r) > a(n)} 



P„[A(r) > q(n)] 
P M [C(r) > n] 



< 



M- 



c 3 



.1/ 



P„[A(r) > a(n)] 



P M [C(r) > n] 



+ oei/ 2 (n) 



where we have used Lemma 43 in the last inequality. By Lemma |l.ll the first term of the 
right-hand side tends to 0. From Theorem 3.1 (ii), it is easy to get t hat P At [A(r) > n] ~ 

we obtain that 

□ 



1.11 



#/iy e p 1 (0)/(ri 1//6 '/i(r?,)) asn-> 00. By combining this estimate with Lemma 
P At [A(r) > a(n)]/P /t [C( r ) > n] tends to 1 as n — > 00. This completes the proof. 

Theorem 4.5. Forn > 1, lett n be a random tree distributed according to P M [- | A(r) > n]. Then: 



C 2C(tn)t (U < t < 1 



(rf) 

n— >oo 



(# t exc ; < t < 1), 



(14) 



where the convergence holds in the sense of weak convergence of the laws on C([0, 1], 
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Proof. Let F be a bounded nonnegative continuous function on C([0, 1],R). By Theorem 4.1 



E, 



/•' ( ^C 2C(T)t (r); < t < 1 



C(r) 



C(r) = n 



E[F(F exc )]. 



Proposition 4.4 (i) entails: 



B. 



C(r) 



C(r) 



C2C(r)t(r); < t < lj 



C(r) > n 



E[F(H exc )}. 



Proposition 4.4 (ii) then implies: 



E, 



C(r) 



C2f(r)t(r); < t < 1 



A(r)> ^on-n 3 / 4 l 



E[F(# exc )]. 



Since \fion — n 3 / 4 ] takes all positive integer values when n varies, the proof is complete. □ 

Remark 4.6. When the second moment of /x is finite, H exc = a/2© where <e denotes the normal- 
ized excursion of linear Brownian motion. Since the scaling constants B n = ayn/2 are known 
explicitly, in that case the theorem can be formulated as: 



a 



C 2C ( tn)4 (tn); < t < 1 



5 Conditioning on having exactly n leaves 

To avoid technical issues, we suppose that supp(/i)\{0} is non-lattice, so that [A(r) = n] > 

for n large enough. 



Recall that we have obtained an invariance principle for GW M trees under the probability 
distribution P M [- | A(r) > n]. Our goal is now to establish a similar result for trees under the 
probability distribution P M [ • | A(r) = n]. The key idea is to use an "absolute continuity" property. 
Let us briefly sketch the main step of the argument. 

Let k > 1. If t is a tree and if u(0),u(l), . . . are the vertices of r in lexicographical order, 
let Tk(r) be the first index j such that {u(0),u(l), . . . , u(j)} contains k leaves and Tfc(r) = oo if 
there is no such index. Fix a G (0, 1) and recall the notation W(r) for the Lukasiewicz path of a 
tree r. Then there exists a positive function D% on Z + such that, for every nonnegative function 
/ on the space of finite paths in Z: 

E„ [/ (W. A T LanjW (r)) | A(r) = n] = E„ [/ (W. ATLarij(r) (r)) ^(W^ j(T) (r)) | A(r) > n] . 

By combining the invariance principle for trees under P M [- | A(r) > n] together with estimates for 
Z}™(Wt l j(T)( r )) as n — > oo, we shall deduce an invariance principle for trees under P M [- | A(r) = 
n]. 
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5.1 The normalized excursion of the Levy process 

We follow the presentation of |12j . The underlying probability space will be denoted by (fl, iF, P). 
Let X be a process with paths in D(R + , M), the space of right-continuous with left limits (cadlag) 
real-valued functions, endowed with the Skorokhod Ji-topology. We refer the reader to [4, chap. 
3] and [321 chap. VI] for background concerning the Skorokhod topology. We denote the canonical 
filtration generated by X and augmented with the P- negligible sets by {3t)t>o- in agreement with 
the notation in the previous sections, we assume that X is a strictly stable spectrally positive 
Levy process with index 9 E (1,2] such that for A > 0: 

E[exp(-AX t )] = exp(tX 9 ). (15) 

See j3] for the proofs of the general assertions of this subsection concerning Levy processes. In 
particular, for 9 = 2 the process X is y2 times the standard Brownian motion on the line. 
Recall that X has the following scaling property: for c > 0, the process (c _1 / e X rf ,t > 0) has the 
same law as X. In particular, if we denote by p t the density of X t with respect to the Lebesgue 
measure, p t enjoys the following scaling property: 

p Xs (x) = X-Wp^xX- 1 '*) (16) 

for x E K and s, X > 0. The following notation will be useful: for s < t set 

If = infX, J t = infX. 

[«,*] [0,*] 

Notice that the process / is continuous since X has no negative jumps. 

The process X — I is a strong Markov process and is regular for itself with respect to X — I. 
We may and will choose —I as the local time of X — I at level 0. Let (gi,di),i E J be the 
excursion intervals of X — I above 0. For every % E J and s > 0, set u\ = X( 9 . +s ) A d- — X 9i . We 
view u* as an element of the excursion space £, which is defined by: 

£ = {u E D(M + ,M + ); w(0) = and Q{u) := sup{s > 0;w(s) > 0} E (0,oo)}. 

From Ito's excursion theory, the point measure 

is a Poisson measure with intensity dtN(du), where N(du) is a a-finite measure on £ which is 
called the ltd excursion measure. Without risk of confusion, we will also use the notation X for 
the canonical process on the space D(M + ,M). 

Let us define the normalized excursion of X. For every A > 0, define the re-scaling operator 
on the set of excursions by: 

S^\uj) = (X 1/9 uj(s/X), s>0). 

Note that N(£ > t) E (0, oo) for t > 0. The scaling property of X shows that the image of 
N(- | C > t) under S^ 1 /^ does not depend on t > 0. This common law, which is supported on the 
cadlag paths with unit lifetime, is called the law of the normalized excursion of X and denoted 
by N(-|£ = 1). We write X exc = (XJ xc ,0 < s < 1) for a process distributed according to 
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N( • |£ = 1). In particular, for 9 = 2 the process X eyiC is \/2 times the normalized excursion of 
linear Brownian motion. Informally, N( ■ \( = 1) is the law of an excursion under the Ito measure 
conditioned to have unit lifetime. 

We will also use the so-called continuous-time height process H associated with X which was 
introduced in j25]. If 9 = 2, H is set to be equal to X. If 9 G (1, 2), the process H is defined for 
every t > by: 

1 ft 

H t := lim- / 1/ x . ^+^ds 



t := am - / L{ Xs <i?+e}as, 

where the limit exists in P-probability and in N-measure on {t < £}. The definition of H thus 
makes sense under P or under N. The process H has a continuous modification both under P and 
under N (see [TBI Chapter 1] for details), and from now on we consider only this modification. 
Using simple scaling arguments one can also define H as a continuous random process under 



N( • |£ = 1). Let us finally mention that the limiting process H e * c in Theorem 4.1 has the 
distribution of H under N( • \( = 1). 

5.2 An invariance principle 

Recall that the Lukasiewicz path W(r) of a tree r G T is defined up to time C( r )- We extend it 
to Z + by setting Wj(r) = for i > ((t). Similarly, we extend the height function H(t) to Z + 
by setting Hi(r) = for i > ((r). We then extend H(r) to IR + by linear interpolation, 

H t (r) = (1 - {t})H lt} (r) + {t}H [t}+1 (r), t > 0, 

where {£} = t — [t\. 

Recall that fi is a probability distribution on N satisfying the hypothesis (Hg) for some 
9 G (1,2]. Recall also the notation h, B n introduced just before Lemma 1 . 1 1 1 For technical 



reasons, we put B u = B^ u j for u > 1. It is useful to keep in mind that B n = when the 

variance a 2 of fi is finite. We rely on the following theorem. 

Theorem 5.1 (Duquesne & Le Gall). Let t n be a random tree distributed according to P M [ ■ | C( r ) — 
n]. We have: 

(i-W LntJ (0,— *Utn)) ^ ( X uH t )o<t<x underN(-\(>l). 
Proof. See the concluding remark of [TBI Section 2.5]. □ 

5.3 Absolute continuity 

Recall from the beginning of this section the definition of Tfc(r) for a tree r. 

Proposition 5.2. Let n be a positive integer and let k be an integer such that 1 < k < n — 1. 
To simplify notation, set W^ k \r) = (Wo(t), . . . , ^T k (r){ T ))- For every bounded function f : 
Uj>iZ J — > JR. we have: 



K [/(^>M)I A(x) = „] = E „ [/(w<'V))|^|||| 



A(r) > n 



where ip p (j) = P^j [A(/) = p] and fpt{j) = P^j [A(/) > p] for every integer p > 1. 
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Proof. Let the random walk W be as in Proposition 1.4 The result follows from the latter 
proposition and an application of the strong Markov property to the random walk W at the first 
time it has made k negative jumps. See [241 Lemma 10] for details of the argument in a slightly 
different context. □ 



We will also use the following continuous version of Proposition 5.2 (see [20, Proposition 2.3] 
for a proof). 



Proposition 5.3. For s > and x > 0, set q s (x) = -p s (—x). For every a G (0, 1) and x > 

define: 

Ogx-ajx) 
J™ a ds q s (x)' 

Then for every measurable bounded function F : D([0, a], 1R 2 ) — > R + ; 



r a (x) 



N (F{(X t )o< t < a , (H t ) < t < a )\C = 1) = N (F{(X t ) < t < a , {H t ) < t < a )r a {X a )\( > 1) . 



We now control the Radon-Nikodym density appearing in Proposition 5.2 Recall that p s 
stands for the density of X s . It is well known that pi is bounded over IR and that the derivative 
of q u is bounded over R for every u > (see e.g. |34"1 I. 4]). 



Lemma 5.4. Fix a > 0. We have: 



(i) lim sup 



ds q s 



B 



0, (ii) lim sup 

n ~ > °° l<j<aB n 



n^n{j) - qi 



B 



n/fj.o , 



The proof of Lemma 5.4 is technical and is postponed to Section 5.5 



Corollary 5.5. Let r n be a sequence of positive integers such that n/r n — > /io as n — > oo. 



(i) We have lim sup 



l<j<aB n 



tPn-latJ, r n ] (J) 



dsq s [ — 

l-a \ B 



n /M0 , 



0. 



(ii) We have lim sup 



l<j<aB n 



n 



B 



"/MO , 



0. 



Proof. We shall only prove (i) . The second assertion is easier and is left to the reader. By Lemma 
i): 

roc 

V'n-LwnjCO - / dsq s 



5.4 



sup 

l<j<aB n 



J 



^(n-[aiM r n \)/^o , 



0. 



By (16) and the definition of q s (x): 

ds q. 



l-a 



B 



"/mo , 



To simplify notation, set ai(n, j) 



to verify that for n sufficiently large: 



(l-a)V«B nM) 



dSqs l(l-a)V°B n/ , 
and a2{n,j^ — i - 



B 



(n-la)i r n \)/ii 



-. It is thus sufficient 



sup 

l<j<a.B n 



ds (q s (ax(n,j)) - q s (a 2 (n, j))) 



(17) 
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From (16), we have for x > 0: 



dsq s (x) = x [ —p s (-x) = x [ J* , pi(-xs 1/e ) = 9 [ Px(-u) du, 
l Jl S Jl s 1+1 ' y Jo 



so that 



ds {q s {ai{n,j)) - q s (a 2 (n,j))) 



a>2(n,j) 
ai (n,j) 



Pi(—u) du 



<9M'\a 2 {n,j)-a x {n,j)\, 



where we have used the fact that pi is bounded by a positive real number M' . Thus we see that 



(17) will follow if we can verify that: 

sup |ai(n,j) - a 2 {n,j) 

l<j<aB n 

and to this end it is enough to establish that: 



T a) 1 / 9 B n i^ a ^(n-|_cwnJ)M) 



0. 



The last convergence is however immediate from our assumption on the sequence (r n ). 

5.4 Convergence of the scaled contour and height functions 

We now aim at proving invariance theorems under the conditional probability measure P M [ • | A(r) 
n). 

Recall the notation Tfc(r) introduced in the beginning of this section. For u > 0, set T u {t) ■ 
T lu\(r). 

Lemma 5.6. Fix a G (0, 1) and a < min(a/2, (1 — a)/2). 



□ 



(i) We have lim P u 

n— >oo ^ 



(ii) We have lim P p 



sup 

fe£(a— a,a+a) 



sup 



T / j, bn\ r ) 


-b 


> 


1 


n 


n l/4 




b 


> 


1 


n 




n l/4 



C(r) > 7i 
A(r) = n 



0. 



0. 



Proof. Both assertions are easy consequences of Corollaries 2.6 and 3.3 Details are left to the 
reader. □ 

Lemma 5.7. Let d be a positive integer. Fix a G (0, 1) and consider a sequence (Z n ) n >i of cadlag 
processes with values in IR d . Let also (K n ) n >i and (S n ) n >i be two sequences of positive random 
variables converging in probability towards 1. Assume that (Z n ) n >i converges in distribution in 
D([0, oo),lR d ) towards a cadlag process Z such that a.s. Z is continuous at a. Then (K n Zg nt ; < 
t < a) converges in distribution in D([0,a],R) towards (X t ; < t < a). 

Proof. By the Skorokhod Representation Theorem (see e.g. [4, Theorem 6.7]), we can assume 
that (X n ) n >i converges almost surely in D([0, oo),R d ) towards (X t ; t > 0) and that both (K n ) n >i 
and (S n ) n >i converge almost surely towards 1. The conclusion follows by standard properties of 
the Skorokhod topology (see e.g. |32| VI. Theorem 1.14]). □ 
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Lemma 5.8. For n > 1, let r n be the greatest positive integer such that \fior n 
a G (0, 1). Let t n be a random tree distributed according to P M [ ■ | A(t) = n]. Then the law of 



n 



. Fix 



1 B 

7 n n 



0<t<a 



converges to the law of (X t , Ht)o<t<a under N( • | ( = 1). 
Proof. We start by proving that for every a > 1: 

(j)AAn(l 



lim I sup 

n-s-oo \ 1 

iB„<j<aB„ 



= 0. 



(18) 



By Theorem 3.1, ip* n {l) / n^ n (l) 

that there exists 5 > such that for n sufficiently large: 



n —7- oo. Using Corollary 5.5, it then suffices to verify 



inf f dsq s ( ) > 5. 

±B n <j<aB n Jl-a \ B n J 

This follows from the fact that there exists 5' > such that J 1 °^ G?sg s (x) > 5' for every xG [1/a, a] 
Details are left to the reader. 



Fix a bounded continuous function F : D([0, a] 



— > 



To simplify notation, for every 



tree r with A(r) > n, set W^(r) = (^ (n) (r)) < t <a and #M( T ) = (# t (n) (r))o<t<a, where for 
< t < a: 



W, 



(»), 



' n 



-^H T ( T )t(r). 

laiJ. r n (T) a \ ) 

I n 



Then set GW(r) = F (^(r), H^(t)). Note that by @, fl"C n )(r) is a measurable function of 
W^(t). Fix a > 1 and put: 



By combining Proposition 5.2 and the estimate (18), we get: 

"W T m(t)' 



lim 

n— too 



e[gw (gi^J -e„ 



G (n) (r) l^ (T) r a 



5 



"•/mo 



A(r) > n 



0. 



(19) 



We now claim that the law of (lU (n) (r), H (n \r)) under P M [ • | A(r) > n] converges towards the 
law of (X t , H t )o<t< a under N(- | ( > 1). To establish this convergence, by Proposition 4.4 (ii), it 
is sufficient to show that the law of 

1 B 

— Wi T , , 1 1 (r), — H T W ((t) 



0<t<a 



under P^[- | C(r) > n] converges towards the law of (X t , H t ) < t < a under N(- | Q > 1). Indeed, 



Proposition 4.4 (ii) will then imply that the same convergence holds if we replace P [ ■ \(( t) > n] 
by P[-|A(t) > \non — n 3//4 ] ] and we just have to replace n by r n . By Lemma 



5.6 



^V['IC( T ) ^ n \) T aiM}n (r) I (an) converges in probability towards 1, and by Theorem 5.1, the 



under 
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law of (^W Lnf j(r),^iy ni (r)) t>o converges to the law of (X t ,H t ) t > under N( • \ ( > 1). Our 
claim now follows from Lemma T5. 71 

From the definition of r n , we have r n /n — > 1 /fiQ as n — > oo, which entails B rn /B n / I1Q — > 1. 
Thanks to (19) and the preceding claim, we get that: 

n limE[G(«)(gi A « (tn) ] = N(F((X t ,H t ) < t < a )T a (X a )l {ka<Xa<a} \(>l) 

= N(F((X t)J ff t )o<i<a)l { i<x a < Q} IC = l), (20) 
where we have used Proposition |5.3 in the last equality. By taking F = 1, we obtain: 

lim lim PU"(U] = 1. 



By choosing a > sufficiently large, we easily deduce from the convergence (20) that: 

lim E [G^(Q] = N(F((X t ,H t ) <t<a) |C = !)• 
This completes the proof. □ 



Recall that C(r) stands for the contour function of the tree r, introduced in Definition 1.3 



Theorem 5.9. For every n > 1 snc/i t/ia£ P A1 [A(r) = n] > ; let t n be a random tree distributed 
according to P M [- 1 A(r) = n]. Then the following convergences hold. 

(i) Fix a G (0, 1). We have: 

' ' W LC(in)tJ (O;0<*<oJ ^% {X t ; 0<t<a) under N(-\( = 1). (21) 



n—>oo 



(ii) We have: 

(^Hc^iQ, ^M Hc{tn)t (t n )) (H t , H t ) 0<t<1 under N( • | C = 1). (22) 

Remark 5.10. It is possible to replace the scaling factors 1/B^ n ) and B^^ tn )/((t n ) by respectively 
IIq 6 ' I ' B n and n)^ 1 ^ B n /n without changing the statement of the theorem. This follows indeed 
from the fact that ((t n )/n converges in distribution towards l//io under P M [- | A(r) = n]. 

The convergence of rescaled contour functions in (ii) implies that the tree t n , viewed as a 
finite metric space for the graph distance and suitably rescaled, converges to the ^-stable tree 
in distribution for the Gromov-Hausdorff distance on isometry classes of compact metric spaces 
(see e.g. |23l Section 2] for details). 

The convergence (21) actually holds with a = 1. This will be proved later in Section 6. 

Proof. Recall that throughout this section we limit ourselves to the case where P At [A(r) = n] > 
for all n sufficiently large. 

We start with (i). As in Lemma 5.8, let r n be the greatest positive integer such that |~/io?" n — 
r n 3 ^ 4 ] = n and write: 

1— W LC(fn)*j(t») = K n ■ W^.^^j (Q, 

S \vn J ' n 
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where K n 



B r n / B t(in) and Sn 



aC(t n )/T Wn (t n ). Recall that r n /n -» l//i . By Corollary [3~3 



C(tn)/n> converges in probability to 1/fMo- On the one hand, this entails that K n converges in 
probability towards 1, and on the other hand, together with Lemma 5.6 (ii), this entails that S n 



converges in probability towards 1. The convergence (21) then follows from Lemmas 5.8 and 5.7 



For the second assertion, we start by observing that: 



^C(tn) 

C(tn) 



#C(i»)t(0; o < t < a 



(J) 



(H t ; 0<t<a) under N( • | C = 1). 



(23) 



This convergence follows from Lemmas 5.8 and 5.7 by the same arguments we used to establish 
(i). To complete the proof we use known relations between the height process and the contour 
process (see e.g. [T2l Remark 3.2]) to show that an analog of (23) also holds for the contour 
process. For < p < ((in) set b p = 2p — H p (i n ) so that b p represents the time needed by the 
contour process to reach the (p + l)-th individual of ((in)- Also set &^(t n ) = 2(£(t n ) — 1). Note 
that Cb p = H p for every p G {0, 1, ... , ((t n )}. From this observation and the definitions of the 
contour function and the height function of a tree, we easily get: 



sup |C t (t„ 

te[b p ,b p+ i] 



H p (t n )\ < \H p+1 (t n )-H p (t n )\ + l. 



(24) 



for < p < ({tn)- Then define the random function g n '■ [0,2£(t n )] — > N by setting g n {t) = k if 
t G \b k ,b k +i) and k < C(t„), and g n (t) = C(t n ) if t G [2(C(t„) - l),2C(t n )]. If t < 2(((t n ) - 1), 
g n {t) is the largest rank of an individual that has been visited before time t by the contour 
function, if the individuals are listed 0, 1, . . . , ((in) — 1 in lexicographical order. Finally, set 
9n(t) = 9n(((in)t)/((i n ). Fix a G (0, 1). Then, by fy: 



sup 

t< b lc<(in)i 

- C(t„) 



C(U) 



C(tr, 



Cc(t„)t(tn) 



C(U) 



H, 



C(t»)9n(*) 



< 



B, 



C(tn) 



+ 



B 



C(tn) 



((Q C(t 



sup \H k+1 (i n ) - H k (t n ) 



n) fc<L«C(t„)J 



which converges in probability to by (23) and the path continuity of H. On the other hand, it 
follows from the definition of g n that 



sup 

< b LcC(tn)J 
- C(tn) 



9n(t) 



< 



< 



1 



C(tn) 
1 



2B, 



sup 

v fc<K(tn)J 



k 



h 
2 



+ 1 



B, 



sup 

C(t„) fc<aC(tn) 



C(in 



-H k (t n ) + 



by (23). Finally, by the definition of b n and using (23) we see that 



VC(tn)] 
C(tn) 



converges in probability 



towards 2a. By applying the preceding observations with a replaced by a' G (a, 1), we conclude 
that: 

— — sup |C 2C ( tn )t(t n J 

C(tnJ 0<t<a 



H(( in ) t (in) 



0. 



(25) 



Together with (23), this implies 

( B 



3 C(tn) 
V C(in) 



^2C(t„)t(tn); < t < aj A (H t ; < t < a) under N( • | ( = 1). 



(26) 
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We now use a time-reversal argument in order to show that the convergence holds on the 
whole segment [0, 1]. To this end, we adapt |T2l Remark 3.2] and [TBI Section 2.4] to our context. 
See also [20J, where we used the same argument to give another proof of Duquesne's Theorem 
II) Observe that (C t (i n ); 0<t< 2(C(t„) - 1)) and (C 2(f(tn) _ 1) _ t (t n ); < t < 2(C(t„) - 1)) have 
the same distribution. From this convergence and the convergence ( |26| , it is an easy exercise to 
obtain that: 



iw) C2(M °- t - 1 ) 



(H t ; < t < 1) under N( • | C = 1). 



(27) 



See the last paragraph of the proof of Theorem 6.1 in [22j for additional details in a similar 
argument. 



Finally, we verify that (22) can be derived from (27). To this end, we show that the conver- 



gence (25) also holds for a = 1. First note that: 



sup 

0<t<2 



9n(t) ~ - 



< -i- I 1 sup H k (Q + 1 



((in) \2 fc<C(t„) 



2B t 



sup 

C(M fc<2C(t„) 



%) r u \ _i_ 1 



■0 (28) 



by (27). Secondly, by (24): 



sup 

0<t<2 



B 



C(tn) 



C(t« 



C 'c(t n )t(tr, 



B, 



C(tn) 



C(t* 



H, 



C(U)fln(t)V^ 



(t. 



< 



+ 



((in) ((in) fc<C(W) 
n) fe<C(tn) 



sup \H k+1 (t n ) - H k (t n ) 



C(tn) Cft 



By (27), in order to show that the latter quantity tends to in probability, it is sufficient to 



verify that sup^^) ((i n ) 1 \b k +\ — b k \ converges to in probability. But by the definition of b p : 



sup 

fe<C(tn) 



bk+i b k 



C(tn) C(U 



sup 

fc<C(tn) 



2 + H k (t n ) — iffc+i^ 



C(tn) 



, 2 -fffc(t r , 
< — r + 2 SUp 



C(tn) 



fc<C(tn) U*n 



which converges in probability to by the same argument as in (28). We have thus obtained 
that 



B 



C(t») 



C(tn) S <Sci \ C2 ^^ tn) " H a^)~9n{2t)^n) 



0. 



Combining this with (27), we conclude that: 



C(t 



-T-C2C(i„)*(tn), -^y^C(tn)?n(2t)( t ") 



0<t<l 



n—^oo 



(H t ,Ht)o<t<i under N( ■ | C = 1) 



The convergence (28) then entails: 



This completes the proof. 



n— >oo 



(^,^) < t <i under N(-|C = 1). 



□ 
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5.5 Proof of the technical lemma 



In this section, we control the Radon- Nikodym densities appearing in Proposotion|5.2[ We heavily 



rely on the strong version of the Local Limit Theorem (Theorem 3.5) 



Throughout this section, (W n ) n >o will stand for the same random walk as in Proposition 1.4 



Recall also the notation q s introduced in Proposition 5.3 



5.5.1 Proof of Lemma 5.4 (i) 



We will use two lemmas to prove Lemma 5.4 (i): the first one gives an estimate for P Mj j [((f) > n] 
and the second one shows that P M j [£(f) > n] is close to P^j [A(f) > Hon — n * 



Lemma 5.11. We have lim sup 



/oo 
dsq s 



B, 



0. 



Proof. Le Gall established this result in the case where the variance of /i is finite in [22j. See 
Lemma 3.2 (ii)] for the proof in the general case, which is a generalization of Le Gall's proof. □ 

Lemma 5.12. Fix a > 0. We have lim sup |P„.-[CC/5 > n] - ^uj[Kf) > ^ n ~ ™ 3/4 ]l = °- 



Proof. To simplify notation, set 7 = /x 1 + 1. By Lemma 4.3, it is sufficient to show that: 



lim sup ¥^[n - jn 3/4 < ((f) < n] = 0. 



l<j<aB n 



From the local limit theorem (Theorem 1.10), we have, for every j G Z: 

J 



B k ¥[W k = j] - Pl 



< e(k), 



where e(k) — > 0. The function x h-> |xpi(— x)| is bounded over 1 by a real number which we 
will denote by M (see e.g. [31 I. 4]). Set M n (j) = P MJ -[n - 7n 3 / 4 < £(f) < n] and 8{n) = 
\n — 7n 3//4 J + 1. Fix e > and suppose that n is sufficiently large so that n — 7?i 3//4 < k < 
implies \e{k)\ < e and B k > B n /2. Then, for l<j< aB n , by Q: 



7? 



M n (j) 



E JWC(*) = *]= E £w = -j] 

A;=5(n) fc=<5(n) 



" M + 2ae 

fc=5(n) ^ 



which tends to as n — > 00. 



Proof of Lemma 5.4 (%)■ By Lemmas 5.11 and 5.12 



lim sup 



P^-[A(f) > /i n - n 3/4 ] - rfs g s (JL 



0. 



□ 



(29) 



30 



Let r n be the greatest positive integer such that [yU r n — r„ 3//4 ] = n. We apply (29) with n 
replaced by r n , and we see that the desired result will follow if we can prove that 



lim 



sup 

K l<j<aB„ 



oo 

ds q a , 
i V B 



J 



dsq s ( — — - 



0. 



The proof of the latter convergence is similar to that of (|17|) noting that: 

B n B n 



lim 

n— >oo 



B r 



B, 



n/fi 



0. 



This completes the proof of Lemma 5.4 (i 



□ 



5.5.2 Proof of Lemma 5.4 (ii) 



The proof of Lemma 5.4 (ii) is very technical, so we will sometimes only sketch arguments. 



As previously, denote by S n the sum of n independent Bernoulli random variables of parameter 
/io, and by W the random walk W conditioned on having nonnegative jumps. More precisely, 
W>[W{ = i] = fi(i + 1)/(1 - fi Q ) for i > 0. Recall that E[W{] = fi /(l - fi Q ) and that a' 2 is the 
variance of W[. 



Fix < e < 1. By Lemma 2.7 



ni>n{j) = 



i-'-j 



A(r) 



n. 



n 



/'o 



n 



en < C(f) < — + en 
Ho 



+ oei/ 2 (n) 



(30) 



where the estimate oe\i-i{n) is uniform in j. It is thus sufficient to control the first term in the 
last expression. For \u\ < ey/n and 1 < j < n set: 



r n {u> 



[n/fio + 



uvn 



a,. 



n (u) = VnP[S rn{u) = n], b n (u,j) = B'J?[W' rn[u) _ n = n- j], 



and using Proposition |1.6 | write: 

n 



nF 



A(r) 



n 



n, 



en < C(f) < — + en 
Ho 

^ J -nS P = n]F[W;_ n = n-j] 



n 



p=\n//j,o—en\ P 

n/na+en+0(i) j 



In/pLo— en+0(l) 

E^/n+o(l) n 

du — —a n (u)b n (u,j). 



n-j\ 



J 



B' n J- e% /E+o(i) r n (u 
Let us introduce the following notation. Set c = fio/(l — /io) and for u, x G 



(31) 



F(u) = e 2 mo(i-mo)' . 

^27T/i (l - /i ) 



G (u,x) = c 1/ V -c 1/e x 



a' 



{cr 2 <oo} 



3.4 



for n 



Put Fq(u) = y/jM}F(fi^ 2 u). Fix a > 0. Set finally a' = a(l + (1 — /^o) 1 ^)- By Lemma 
sufficiently large, we have aB n < a'B' n . We suppose in the following that n is sufficiently large 
so that the latter inequality holds. 
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Lemma 5.13. For fixed «6M, we have: 



a n {u) 



F (u), sup \b n (u, j) - G (u,j/B' n 

l<j<aB n 



0. 



Proof. The first convergence is an immediate consequence of Theorem 1.10 (iii) after noting that 
E [Si] = fio and that the variance of Si is //o(l — A*o)- The second convergence is more technical. 
To simplify notation, set: 



q n (u) = r n (u) - n, Q n (u,j) = c 1,e pi 



n-j - cg n (u) 

B 'qn(u) 



Note that q n (u) = n/c + Uy/n + 0(1). In particular, B' n ~ c l,e B' as n — > oo. Consequently, 
by ([7]), |&n(w, j) — Q n (u,j)\ — > as n — > 00, uniformly in < j < aB n . It thus remains to show 



that 



To this end, introduce: 



sup \Q n (u,j) - G (u,j/B' n 

l<j<aB n 



n-j - cq n (u) 1/g j V2c 3/2 u 

B> + W + o< {a2< ™ } 



(32) 



Recall that the absolute value of the derivative of pi is bounded by a constant which will be 
denoted by M', giving \Q n (u,j) — G (u,j/B' n )\ < M'K n {u, j). It is thus sufficient to show that 
K n (u,j) — > as n — > 00, uniformly in < j < aB n . 

We first treat the case where a 2 < 00, so that 6 = 2. In this case, B' n = where a' 2 is 

the variance of W[. Simple calculations show that K n (u,j) < A/^fn for some A > 1 depending 
only on u, so that K n (u,j) — > as n — > 00, uniformly in < j < aB n . 

Let us now suppose that a 2 = 00. Fir st as sume that 9 < 2. Choose 77 > such that 

(i), for n sufficiently large, B' q , u ^ > n l l e ~' n . 



1/6 - 7] - 1/2 > 0. By Proposition 



1.7 



Moreover, we can write B' n / (c 1 ^ B' q , a) = 1 + e n (u) where, for fixed u, e n (u) — > as n — > 00. 
Putting these estimates together, we obtain that for large n and for 1 < j < aB n : 



,1/ 



c W en{u) + -, 

n n n q„(u) 



< a'c^e n {u) +™+o(4m 



which tends to as n — > 00. 

We finally treat the case when a 2 = 00 and 6 = 2. Recall the definition of the slowly varying 
function h! introduced in Section 3.1 and let e n (u) be as previously. By the remark following the 
proof of Theorem 1.10 h! is increasing so that for n large enough: 



K n (u,j) 



h'{q n {u))^q n {u) 



<a'c^e n (u) + A- 



11. 



h'(n/(2c)) 



+ 



1/1 



11 



for some A > 0. The latter quantity tends to as n — > 00 since h'{n) — > 00 as n — > 00 by the 
remark following the proof of Theorem |1.10[ This completes the proof. □ 
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Proof of Lemma 5.4 (ii)- From Theorem 3.5 we have the bound a n (u) < C(l A u 2 ) and by rf7J, 
the functions b n are uniformly bounded. Since, for j < aB n , 



B' n 



du — — a n (u)b n (u,j) - fi I dua n (u)b n (u,j] 

Ey/n 



r n {u 

it follows from the dominated convergence theorem that: 

- I In I 

B; 



i /" ev/ ™ 
< Ca / du{ 



lAu 



sup 

l<j<aB„ 



J_ 

BL 



eVn n j 

du — —- a n (u)b n (u,j) - — fj, 



—ey/n 



dua n (u)b n (u,j] 



n 



r n {u) 



0. 



Mo 



(33) 



Recall that qi(x) = xpx(-x). By (30), (31) and (33), to prove Lemma 5.4 (ii), it is sufficient to 
establish that: 



sup 

l<j<aB„ 



fi dua n (u)b n (u,j)-c 1/e p 1 

J -Ey/n 



J 







Let us first show that: 

ey/n 

sup 



l<j<aB n 



Mo 



Ey/n 



+oo 



dua n (u)b n (u,j) - fiQ / duF (u)G (u,j/B' n 



0. 



(34) 



(35) 



To this end, let us prove the following stronger convergence: 

eV^ ( \ 

du \ sup \a n (u)b n (u, j) - F (u)G (u,j/B n )\) — y 0. (36) 

Ey/E \l<j<aB n j 

It is clear that the function Go is uniformly bounded. Recall that the functions b n are uniformly 
bounded as well. Moreover, F is an integrable function and we have the bound a n (u) < C(l A 
u~ 2 ). The convergence (36) then follows from Lemma 5.13 and the dominated convergence 
theorem. This proves (35). 

To conclude, we distinguish two cases. First assume that cr 2 < oo, so that 6 = 2. Then 
W[ has finite variance cr' 2 as well. Recall that a' 2 = (cr 2 — /x )/(l — Mo) — (Mo/(l — Mo)) 2 an d 
B' n = a'y/n/2. A straightforward calculation based on the fact that, for a, /3, 7, 5 > 0, 



+00 



due- au2 e~^ +Su)2 



71 



g a/P + 6 2 



gives: 



Mo 



/+00 r+00 I 

du F (u)G (u, j j B' n ) = / du ii F (u)c /2 pi [c 
-00 j—00 \ 



1/2 J 
a'y/n/2 



V2c 3 / 2 u 



a 



c 1/2 P i 



a\Jn/(2^ ) 



c 1/2 Pl 



B 



By combining this with (35), we get (34) as desired. 

Now assume that a 2 = 00. In this case: 

r+00 



Mo 



/+00 r+00 I 

duF (u)G (uJ/B' n ) = / du iinFn{u)c l/ p x I -c 
-00 J—00 \ 



,1/6 J_ 
BL 



By Lemma 3.4, B' n /B n /^ Q —y c 1 ^, which implies: 



sup 

l<j<aB„ 



Pi 



,1/0 J_ 

' Bl 



Pi 



c 1/e pi 



0. 



BL 



By combining (37) and (38) with (35), we get (34) as desired. 



(37) 



(38) 



□ 
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6 Convergence of rescaled Lukasiewicz paths when condi- 
tioning on having exactly n leaves 

We have previously established that the rescaled Lukasiewicz path, height function and contour 
process of a tree distributed according to P M [- | A(r) = n] converge in distribution on [0, a] for 
every a G (0, 1). Recall that by means of a time-reversal argument, we were able to extend the 
convergence of the scaled height and contour functions to the whole segment [0, 1]. However, since 
the Lukasiewicz path W(r) of a tree distributed according to P M [- | A(r) = n] is not invariant 
under time-reversal, another approach is needed to extend the convergence of W(r) (properly 
rescaled) to the whole segment [0, 1]. To this end, we will use a Vervaat transformation. Let us 
stress that the Lukasiewicz path of a tree distributed according to P M [ • | A(r) = n] does not have 
a deterministic length, so that special care is necessary to prove the following theorem. 

Recall that /i is a probability distribution on N satisfying the hypothesis (He) for some 
9 G (1,2]. Recall also the definition of the sequence (B n ), introduced just before Lemma [l . 1 1 [ 



Theorem 6.1. For every n > 1 such that F u [X(t) 
according to F u [- | A(r) = n]. Then: 



n] > 0, let i n be a random tree distributed 



B 



C(t») 



-w LC(t „ )4J (U o<t< i 



(d) 
n—>oo 



(X t ; < t < 1) under N( • | £ = 1). 



(39) 



As previously, to avoid further technicalities, we prove Theorem 6.1 in the case where P^[A(r) = 
n] > for all n sufficiently large. Throughout this section, (W n ) n >o will stand for the random 



walk of Proposition 1.4 Introduce the following notation for n > and u > 0: 



ra-l 



A(n) = £ l {Wi+1 - Wj =-i}, T u = M{k > 0; A(k) = [u\}, ( = ini{k > 0; W k = -1}. (40) 

3=0 

For technical reasons, we put B u = B\ u \ for u > 1. 
Lemma 6.2. The following properties hold. 



(i) We have P 



T 



n jj 
(ii) For every a > 0, 



> 



n 



1/4 



oe 1/2 (n) 



1 



W\t T \;0<t<a 



n— >co 



(X t ; 0<t<a) under P. 



Proof. The first assertion is an easy consequence of Lemma 2.5 (i). For (ii), we use a general- 
ization of Donsker's invariance theorem to the stable case, which states that \ W\_ n t\/B n ; t > 
converges in distribution towards (X t ; t > 0) as n — > oo. See e.g. [32, Chapter VII]. By (i), T n /n 
converges almost surely towards 1///q> an d (ii) easily follows. □ 



6.1 The Vervaat transformation 

We introduce the Vervaat transformation, which will allow us to deal with random paths with 
no positivity constraint. Recall the notation introduced in Section 1.3. 
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Definition 6.3. Let k > 1 be an integer and let x = (x 1: . . . , x k ) G Z k . Set Wj — X\ H + Xj 

for 1 < j < k and let the integer i*(x) be defined by i*(x) = inf{j > l;tUj = mini<j<fc Wi}. The 
Vervaat transform of x is defined as V(x) = 

Also introduce the following notation for positive integers k > n: 

k 

§ k (n) = {(xi, . . . , x k ) E {—1, 0, 1, . . .}; ^Xi = —1 and Card{l < % < k; Xi = — 1} = n}, 

i=i 

as well as: 



S k ( n ) = {( x i, ■ ■ ■ , x k) e S fe H; > -1 for ever y me{i,2,...,k- l}}. 

1=1 



Finally set S(n) = Ufc> n S (n). 

Lemma 6.4. Let k > n be positive integers. Set Z k = (Wi, W 2 - Wi, . . . , W k - W k -i). 

(i) Conditionally on the event {W k = —1}, the random variable i*(Z k ) is uniformly distributed 
on {1,2, ... ,k} and is independent of V(Z k ). 



k 

(ii) LetxeS (n). Then: 



P [V{Z k ) = x, Z k = -l] = j¥ [V{Z k ) = x] . (41) 

K 



Proof. The first assertion is a well-known fact, but we give a proof for the sake of completeness. 
k 

Let x G S (n) with k > n. Then: 



P [Z k = x] = P \( zk ) = x l = r p \V{Z k ) = x] . (42) 
^ 1=1 ''' 



For the first equality, we have used the fact that Z k and (yZ k ^ have the same law. The second 
equality follows from the fact that by the Cyclic Lemma, there exists a unique 1 < i* < k such 
that (Z k f* ] e U n >!S fe (n), which entails V(Z k ) = (Z k f* ] . Then, for 1 < % < k: 

P [i*{Z k ) = i,V(Z k ) = x] = P [(Z k ) {i) = x] = P [Z k = x] = ip [V(Z fe ) = x] . 

The conclusion follows. 

For (ii), write x = (x±, . . . , x k ) and observe that 

P [V(Z k ) =x,Z k = -l]=F [V(Z k ) = x, x k _ lt(zk) = -1] . 

The conclusion follows from (i) since Card{l < i < k; x^ = —1} = n. □ 

Proposition 6.5. For every integer n > 1, the law of the vector V(W\, W2 — W\, . . . , Wt u — 

Wt„-i) under P [ • | Wt„ = — 1] coincides with the law of the vector (Wi(r), W2(r) — Wi(r), . . . , 
W ( ( T) (r)-'W ((T) _ 1 (r)) ,, «nderP M [.|A(r)=n]. 
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Proof. To simplify notation set Z = (W\, W2 — W\, . . . , Wt„ — Wt„-i)- Fix an integer k > n, 
and set Z k = (W 1 ,W 2 - W u . . . ,W k - W fc _i). Let x = (x 1} ...,x k )e § k (n). We have: 

P [\{Z) = x I W Tn = -1] = P \V{Z k ) = *,T n = k\W Tn = -1] 

simply because Z = Z k on the event {T n = k}. Then write: 

P \V(Z k ) = x,T n = k] P \V(Z k ) = x, Z£ = -1 



F[V(Z)=x\W Tn = -l] 



F[W Tn 

n p[y(z k ) 



-1] 



k F[W Tn = -l] 
nF [Z G S(n 



n- 



F[W T% 
P [Z k = x] 



P [W Tn = -1) 



F[W Tn = -l 
P [Z = x I Z G S(n 



n 



-1] 



P[W Tm = -l] 



where we have used (|41h for the third equality and (42) for the fourth equality. Summing over all 

—k r — i 

possible x G S (n) and then over k > n, we get P [Wt„ = — 1] = nP [Z G S(n)J . As a consequence, 
we have P [V{Z) = x | W Tn = -1] = P \Z = x | Z G S(n)l for every x G S(n). 



On the other hand, by Proposition 1.4 for every x G S(n 



P M [(W x (r), . . . , W c(r) (r) - W c(r) _ 1 (r)) = x I A(r) = n] = P = x | A(C) = n] 

where we have used the notation ( introduced in (40). The probability appearing in the right- 
hand side is equal to P \Z — x | Z G S(n)l because {A(£) = n} = {Z G S(n)}, and moreover we 
have ( = T n and Z^ = Z on this event. We conclude that: 

P, [(W^r), . . . , W c(T) (r) - W c(rM (r)) = x | A(r) = n] 



P[Z = x| Z G S(n) 
P[V(Z)=x|Wr n = -l]. 



This completes the proof. 



□ 



Definition 6.6. Set D ([0,1],R) = {to G D([0, 1],R); w(0) = 0}. The Vervaat transformation 
in continuous time, denoted by V : O ([0, 1],R) — > D([0, 1],R), is defined as follows. For u> G 
D ([0, 1],R), set gx(u) = inf{t G [0,l];w(t-) Aw(t) = inf [0 ,i]w}. Then define: 



V(«)(f) 



u(gi(u) +t) - inf [o,i] w, 

+ t - 1) + w(l) - inf[ ,i] w 



if PiM+t < 1, 
if 5-1(0;) +t > 1. 



b c(t) ' ' LCMtJ 0); < t < 1 J under P m 



Corollary 6.7. The law of W 

toe /aw 0/ V (^W^fj ; < t < l) under P [ • | W Tn = -1] . 



nl coincides with 



This immediately follows from Proposition 6.5 In the next subsections, we first get a limiting 
result under P[ - 1 Wr n = — 1] and then apply the Vervaat transformation using the preceding 
remark. The advantage of dealing with P [ • | Wt„ = — 1] is to avoid any positivity constraint. 
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6.2 Time Reversal 



The probability measure P [ • | Wr n = — 1] enjoys a time-reversal invariance property that will be 
useful in our applications. Ultimately, as for the height and contour processes, this time-reversal 
property will allow us to get the convergence of rescaled Lukasiewicz paths over the whole segment 
[0,1]. 

Proposition 6.8. Fix two integers m > n > 1 such that P [W m = 0, A(m) = n] > 0. For < i < 

m, set Wi = W m - W m -i. The law of the vector (W , . . . , W m ) under P [ • | W m = 0, A(m) = n] 



coincides with the law of the vector (W , . . . , under the same probability 



measure. 



Proof. This is left as an exercise. □ 
6.3 The Levy Bridge 

The Levy bridge X hr can be seen informally as the path (X t \ < t < 1) conditioned to be at 
level zero at time one. See (3j Chapter VIII] for definitions. 

Proposition 6.9. The following two properties hold. 

(i) The continuous Vervaat transformation V is almost surely continuous at X hr and V(X hr ) 
has the same distribution as X under N( ■ | ( = 1). 

(ii) Fix a G (0, 1). Let F be a bounded continuous functional on O([0, a],R). We have: 

Px- a (-X a 



E [F (X 4 br ; < t < a)] = E 



F(X t ;0<t<a) 

Pi(0) 



Proof. The continuity of V at X hr follows from the fact that the absolute minimum of X hr is 
almost surely attained at a unique time. See Theorem 4] for a proof of the fact that V(X hr ) has 
the same distribution as X under N( ■ |£ = 1). For (ii), see [31 Formula (8), chapter VIII. 3]. □ 

6.4 Absolute continuity and convergence of the Lukasiewicz path 

By means of a discrete absolute continuity argument similar to the one used in Section 5, we shall 
show that for every a G (0, 1) the law of \j^W\r n t\ < t < a^j under P [ • | Wr n = —1] converges 
to the law of (X t br , < t < a). 

Lemma 6.10. Fix a G (0,1) and let n be a positive integer. To simplify notation, set W^ u > = 
(Wo, Wi, . . . , Wr,, ) for u > 0. For every function f : Uj>iZ l — > R + we have: 

E \f(W {an) ) I W T =-l]=E frty(™)) Xn -\- an l( WT ™ 

l Xn(0) 

where Xk{j) — Pj[Wr fe — — 1] f or every j G Z and k > 1, and W starts from j under the 
probability measure Fj. 

Proof. This follows from the strong Markov property for the random walk W. □ 
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Lemma 6.11. For every a > 0, we have lim sup 

\j\<aB n 



(j) - Pi 



B 



n/fj,Q 4 



0. 



Note that j is allowed to take negative values. Note also that Lemma 6.11 implies that 
Xn(0) ~ pi(0)/-B n/w) as n ->• oo. 



Proof. Fix e G (0, 1). Using Lemma, 6.2 (i), we have 



X „(j) = P,[^t„ = -1] = P [Wt„ = —j — 1] 



p 



Wr„ = -j - 1, 







T 


< en 







+ oei/ 2 (n) 



]T P [W fc = -j -l,T n = k] + oe 1/2 (n) 

\k—n/ fio\<en 

= (mo Yl P[W fc _i = -j,A(A;-l)=n-l] + oei /2 (n) 

|fc— n//io|<en 

Recall that S* n stands for the sum of n iid Bernoulli random variables of parameter fiQ and 
that W is the random walk W conditioned on having nonnegative jumps. By (|5]): 

P;[Wt„ = -1] 

= /i £ P ^-i = n - 1]P = n-j-l]+ oe 1/2 (n) 

\k—n/ iM>\<en 
rn/ /j +en+O(l) 

= A* / S[»-i| = n - 1 P [Wlj_ n = n - j - l] + oe 1/2 (n). 

Jn/fi()—en+0(l) 

Hq I du \fn P 



-ey/n+o(l) 

For \u\ < e-y/n, set: 



5 



n — 1 



P 



PUf ■ =n-7-l 



+ oei/ 2 (n) 



a n (w) = v^P 



Si „ _ i = n — 1 



Using the notation of Section 5.5.2, we have then: 



ev^+o(l) 

B n Fj[W Tn = -l]=n _ dua n (u)b n (u,j + 1) + oe 1/2 (n). 

eV^+o(l) 



The same argument that led us to (34) gives that: 



sup 

\j\<aB„ 



B' n f 3 [W Tn = -l]-c l l e Pl (- 



B 



n/n , 



The conclusion follows from that fact that B' n jB n i m — > c 1 ^ 9 . 



□ 



All the necessary ingredients have been gathered and we can now turn to the proof of the 
theorem. 



Proof of Theorem 6.1. Let F : D([0, a],R) — > IR+ be a bounded continuous function. Fix 
a G (0,1) and a > 0. To simplify notation, set A\l = {\W Tan \ < aB n/lM) } and G {n) (W) = 
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F ( B * Wyt Tan ^ ; < t < a). We apply Lemma 



and get: 



6.10 



E[G (n) (WOl A . \W Tn = - 
Lemma [6. Ill then entails: 

E[G^(W)l A?t \W Tn = -l] -E 



1 = E 



with f{W , W 1 ,..., W Tan ) = GW{W)1 A % 

Xn-[an]{W Tan 



G^(W)l A?i 



Xn(0) 



lim 



T 

- 1 an 



(n-[an})/n , 



From Lemma 6.2 (ii), we deduce that: 



lim E \G {n) (W)l Aa \W T 



1 = E 



:i - a) 



tj0<t<aJi{|X a |<a}- 



Pi(0) 



-Pi 



x n 



I -a) 



By (16), we have (1 — a) l ^ e p\ 
we conclude that: 



(l-o)V9 



Pi_ a (— X a ). Consequently, by Proposition 



6.9 



(43) 



Urn E | Wr n = -l] = E [F(X t br ; < t < a)l {lx ^ a}{ 

By taking F = 1, we obtain: 

lim lim P [A" I W T = -11 = 1. (44) 



By choosing a > sufficiently large, we easily deduce from the convergence (|43|) that: 
/ 1 



lim 

n— too 



E 



; < t < a Wt l = -1 



- E [F(X br ; < t < 



0. 



Next write: 



where K n = Bn/^/B^ and S n = aT n /T an . Lemma 6.2 (i) entails that K n and S n both converge 
in probability towards 1. Lemma 5.7 then implies that the law of (yj^rW\r n t\'i <t < aj under 
P [ ■ | Wx n = —1] converges to the law of (X^ T , < t < a), and this holds for every a G (0, 1). 

We now show that the latter convergence holds also for a = 1 by using a time-reversal 
argument based on Proposition 6.8, By the usual tightness criterion (see e.g. [4, Formula 
(13.8)]), it is sufficient to show that, for every r) > 0, 



Note that: 



lim lim sup P 

<5— >0 n— >oo 



sup 

se[i-5,l) 









sup 


r W lT n s} 
JDT 


> 1] 


_se[i-<S,i) 





w 



T 
J- n 



0. 



(45) 



W\ 



T 

J- n 



[T n s\ 



sup 

L(l-5)T n J<fc<T n -l 



B 



T. 

J- n 
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Using this remark, we write: 



P 



sup 

_SS[1-<5,1) 



1 



1 



■W[T n s} 



w 7 



F[W 7 



-1] 



/'o 



F[W T 



k=n 

oo 









1, sup 




> T) 


se[i-5,i) 





•!] fn 









i = 0, sup 




> T] 


se[i-5,i) 





< 



/'o 



P[W Tl 
P 



-VP 









i = 0, sup 




> 77 







sup 

se[0,<5+l/T n 



> 77 



W 7 



-1 



using Proposition 6.8 in the upper bound of the last display. (45) then follows from the fact that 
that the law of (ji^W\T n t} ] < t < a) under P [ • | W Tn = -1] converges to the law of (X t br , < 
t < a) for every a £ (0, 1). We conclude that this convergence also holds for a — 1. 

We then combine the continuous Vervaat transformation V with the latter convergence. Since 
V is almost surely continuous at X hv (Proposition 6.9 (i)), we get that the law of 

v(J_ w . <t < 1 

JDT 



under P[ - | Wr n = —1] converges to the law of V(X hv ). Corollary 6.7 and Proposition 6.9 
entail: 



B, 



1 

C(tn) 



W LC(tn)tJ (t n ); < t < 1 



n— >oo 



(X t ; < t < 1) under N( • | C = !)■ 



This completes the proof. 



□ 



7 Application: maximum degree in a Galton- Watson tree 
conditioned on having many leaves 

In this section, we study the asymptotic behavior of the distribution of the maximum degree in 
a Galton- Watson tree conditioned on having n leaves. To this end, we use tools introduced in 
Section 6 such as the Vervaat transformation and absolute continuity arguments. 

As earlier, we fix 9 £ (1,2] and suppose that fi is a probability distribution satisfying the 
hypothesis (Ho). For every n > 1 such that P^[A(r) = n] > 0, let also t n be a random tree 
distributed according to P M [- | A(r) — n]. If r £ T is a tree, let A(r) = max{fc u ; u £ r} be the 
maximum number of children of individuals of r. We are interested in the asymptotic behavior 

ofA(in). 

The case 1 < 9 < 2 easily follows from previous results. Indeed, let (B n ) n >\ be defined as 



before Lemma 1.11 Then, by Theorem 6.1 and Remark 5.10 




W Lf(WtJ (O;0 



< t < 1 



(«0. 



(46) 
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If Z e D([0, 1],R), let A(Z) be the largest jump of Z. Note that by construction, A(W(t n ))) 



A(t n ) — 1. Since A is a continuous functional on D([0, 1],R), (46) immediately gives that 

1/0 — — — 

fi A(X)/ B n converges in distribution towards A(X exc ), which is almost surely positive. 

However, in the case a 2 < oo, A(X exc ) = almost surely since X exc is continuous. It is 
natural to ask whether the suitably rescaled sequence A(t n ) converges to a non-degenerate limit. 
A similar question has been previously studied by Meir & Moon [26j when t n is distributed 
according to P^[ • | £(r) = n]. We shall make the same assumptions on /i as Meir & Moon. 

More precisely, let v be a critical aperiodic probability distribution on N with finite variance. 
Let R be the radius of convergence of v{i)z l . We say that v satisfies hypothesis "K if the 
following two conditions hold: R > 1 and if R < oo, u{n) 1 / n converges towards 1/R as n — > oo, 
if R = oo there exists N > such that the sequence (i / (fc) 1 ^ fe )fc>Ar is decreasing. 

Theorem 7.1. 

(%) Ifl<6< 2, we have fil /9 A{t n )/B n A(X exc ). 

(ii) Set D(n) = max{fc > 1; [i([k, oo)) > 1/n}. If a 2 < oo, under the additional assumption 
that n satisfies hypothesis "K, we have for every e > 0: 

F[(l-e)D(n)<A(Q<(l + e)D(n)} ^ 1. 

Part (i) of the theorem follows from the preceding discussion. It remains to prove (ii). We 
suppose that fi satisfies the assumptions in (ii). The first step is to control the asymptotic 
behavior of D(n). 

Lemma 7.2 (Meir & Moon). Let e > 0. For n sufficiently large: 

- e)D(n), oo)) > n~^, + e)D(n), oo)) < n"^ e/3 . 

Proof. See the proof of Theorem 1 in [26], which uses the different assumptions made on \i. □ 



Proof of Theorem 7.1 in the case a 2 < oo. The idea of the proof consists in showing that if the 
Lukasiewicz path of a non-conditioned Galton- Watson tree satisfies asymptotically some property 
which is invariant under cyclic-shift (with some additional monotonicity condition), then the 
Lukasiewicz path of a conditioned Galton- Watson tree satisfies asymptotically the same property. 



We first establish the lower bound. Recall the notation introduced in (40 ). If u = (u\, . . . , Uk) € 
Z fc , set M(u) = maxi<i< k Ui, so that A(t„) = M(W x (t n ) - W (t„), . . . , - WcM-ifa)) + 
1. Note that M is invariant under cyclic shift. Set p n = (1 — e)D(n). To simplify notation, for 
«!,... ,Uk e Z set F (n) (wi, ...,u k ) = l{M(wi,...,u fe )<Pn}- We have: 

P [A(t n ) < p n + 1] = E M [F^(W 1 (t) - W (r), W ?(r) (r) - W c{Tyi (r))) \ A(r) = n] 
= E [F( n )(V(^ 1; W 2 - Wi, . . . , W Tn - W Tn -x)) I W Tn = -l] 
= E [F^(W U W 2 -W 1 ,..., W Tn - Wt^x) I W Tn = -l] , (47) 



where we have used Proposition 6.5 in the first equality, and the fact that F^(V(u)) = F^(u) 
for every u e Z fc (k > 1) in the second one. To simplify notation, we put F^ n) (W) = F n (W h W 2 - 



W u ...,W k - W k ^). Note that E F^{W) \W Tn = -1 < E F^J (W) \ W Tn = -1 



^/2 ' 



In order to 
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7.1 



establish the lower bound in Theorem 

tends to as n — > oo. Let a > 0, and let the event A" be defined by 



li), it then suffices to prove that E \f^ /2 (W) \ W Tn = -1 



where a 2 is the variance of /i. By Lemma 6.2 (i), we have: 



E 



4%{W)\W Tn = -\ 



< E [1 {A . }C | W Tn 



E 



By Lemma [6. 10 
E 



Ft] (W)li A ° ^<t „<j±\ \W T = -1 



' 4 M0 



l/2 



E 



( W ') l {A^<T n/2 <^}\W 7 



+ oei/ 2 (n) 



p (n) Xn-[n/2\(W Tn/2 ] 



where Xn(j) — ^j[^T n = — 1]- By Lemma 6.11, there exists C > such that for every n large 
enough, Xn-\n/2\(WT n/2 )/Xn{fy < C on the event A". By combining the previous observations, 
we get: 



E 



F<t%(W)\W Tn = -l 



< E [l {A « y | W Tn = -1] + CE [F^ /2 (W)l { n< Tn/2 <^ } \ + oe 1/2 



n 



(4E 



By (44), we have: 



lim lim E [l/A<ne I Wt 



■1 



0. 



Let us finally show that the second term in the right-hand side of (48) tends to zero as well. We 
have: 



E 



F TZS w )h^<T n/2 <^ } \ < p [m(w 1 , w 2 -w 1 ,..., w [n/4floi - w Ln/4M0 j-i) < v, 

= F[Wx < p n ] Ln/4wJ = (1 - P[W X > p n }) [n/4lMo1 ■ 



The first part of Lemma 7.2, implies that the last quantity tends to as n — > oo. By combining 
the previous estimates, we conclude that P[(l — e)D(n) > A(t n )] — > as n — > oo. 

Let us now establish the upper bound. Set q n = (1 + t)D{n). By an argument similar to 
the one we used to establish we get P[A(t n ) > q n + 1] = P[M(Wi, . . . , W Tn - W Tn -\) > 

Qn | Wt„ — — 1]- It follows that: 

P[A(t n ) > q n + 1] < nM(W 1 ,W 2 -W 1 ,...,W Tn/2 -W Tn/2 ^)>q n \W Tn = -l] 

+nM(W Tn/2 - W Tn/2 -u ■ ■ ■ , W Tn - W Tn -i) > q n I W Tn = -1] 



By a time-reversal argument based on Proposition it is sufficient to show that the first term 
of the last expression tends to 0. To this end, we use the same approach as for the proof of the 
lower bound, taking this time Ff, (W) = l{M(Wi,...,w k -w k -i)>qn}- ^ * s then sufficient to verify 
that: 



E 



T-n/2 ' 



i w )k^<T n/2 <^} 



o. 
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To this end, write: 



E 



tl (W)l { ^ n/a ^ } \ < F[M(Wx, W 2 -W l7 ..., W [n/ , o} - W^^)) > q n ] 

= l-(l-nW 1 >q n ]) ln/t * i 



which tends to as n — > oo by Lemma |7.2| By combining the previous estimates, we conclude 
that P[(l + e)D(n) < A(t n )] — > as n — > oo. This completes the proof of the theorem. □ 

Remark 7.3. In particular cases, it is possible to obtain better bounds in the previous theorem. 
Let (j, be defined by /i(0) = 2 - y/2, = and = ((2 - v / 2)/2)^ 1 for i > 2 (this 

probability distribution appears when we consider the tree associated with a uniform dissection 
of the n-gon, see [9j). One verifies that \x is a critical probability measure. For n > 1, let t n be 
a random tree distributed according to P M [- | A(r) = n\. One easily checks that \x is the unique 
critical probability measure such that t n is distributed uniformly over the set of all rooted plane 
trees with n leaves such that no vertex has exactly one child. In this particular case, Theorem 



7.1 (ii) can be strengthened as follows: 

P[log, n - c\og b log, n < A(t n ) < log 6 n + clog fe log b n] 1, (49) 

shows that it is 



7.1 



for every c > 0, where b = l//i(2) = v^2 + 2. Indeed, the proof of Theorem 
sufficient to verify that for every c > 0: 

(1 - P[W l > \og b n- clog, log, n]) n/4M ^0, (1 - P[W X > \og b n + clog, log, n}) n/Afl ° ^ 1. 

These asymptotics are easily obtained since the probabilities appearing in these two expressions 
can be calculated explicitly. 

The convergence (49) yields an interesting application to the maximum face degree in a 
uniform dissection (see P, Prop. 3.5]). 



8 Extensions 

Recall that if A is a non-empty subset of N and r a tree, Ca( t ) is the total number of vertices 
u G t such that k u (r) G A. For a forest f, Oi.(f) is defined in a similar way. In this section, 
we extend the results (I) and (II) appearing in the Introduction to the case where A ^ {0}. 
By slightly adapting the previous techniques, it is possible to obtain the following more general 
result. 

Recall that fi is a probability distribution on N satisfying the hypothesis (Hg) for some 

9 G (1,2]. We also consider the slowly varying function h and the sequence (-B n )n>i introduced 



just before Lemma 1.11 



Theorem 8.1. Let A be a non-empty subset ofN. If fi has infinite variance, suppose in addition 
that either A is finite, or N\A is finite. 

(I) Let d > 1 be the largest integer such that there exists b G N such that supp(fi)\A is contained 
in b + dZ*, where supp(fi) is the support of fi. Then : 

p,[cm=»i ^ M^v(o) 8 ^-;,;f 

for those values of n such that P M [Ca(t) = n] > 0. 
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(II) For every n > 1 such that P^ [(a( t ) — n] > 0, let t n be a random tree distributed according 



toP M [-|Cyi(r) 




w LC(tn)t](t 



5, 



C(tn) 



, 5 C(tn) 

converges in distribution to (X 



' C(U) 



C2C(t„)t(tn) 



5, 



C(U) 



C(tn 

as n — ?■ oo. 



0<t<l 



Theorem |8. 1| can be established by the same arguments used to prove Theorems |3.1 , 5.9 and 

(ii), which 



6.1 The main difference comes from the proof of the needed extension of Lemma 5.4 



is more technical. Let us explain the argument leading to the convergence 

3 



lim 



sup 

l<j<aB„ 



nP At ,[Ci(f)=n]-g 1 



0. 



(50) 



The first step is to generalize Proposition 1.6 and find the joint law of (C(f), d(f)) under P 



(which is the contents of the latter proposition in the case A = {0}). To this end, let p and p! 
be the two probability measures on NU { — 1} defined by: 




if i + 1 e A 
otherwise 



//(<) 




It is then straightforward to adapt Proposition |1 ,6| and get that: 

J 



p M -[c(f)=p,a(f) 



n\ 



P 



■F[S p = n}F[W' 



p—n 



otherwise. 

-u n -j}. 



where S p is the sum of p independent Bernoulli random variables of parameter fi(A), (W^) n >i is 
the random walk started from with jump distribution p! and {U n ) n >\ is an independent random 
walk started from with jump distribution p. Note that — U n = n when A = {0}. 

First suppose that p has finite variance. Then both W[ and U\ have finite variance. As in 
the proof of Lemma [5. 4[ we have, for < e < 1: 

nP« Mf) 



n 



n 



dx/^F [S [xl =n]F[WL l _ n 



-U n - j] + oe 1/2 (n). (51) 



By the law of large numbers, we can suppose that for n sufficiently large, P [\U n — nE \U\] \ > en] < 
e. Set t n (v) = [nE [Ui] + v \/n\ for n > 1 and It follows that: 



P [w[ : 



x\ —n 



-u„ 



J 



-ey/n+o(l) 



x\ —n 



t n (v)-j\F[U n = t n (v)] 



< e. 



The local limit theorems give bounds and estimates for the quantities P [W^'zJ-n = ~tn{v) — j] 
and P [U n = t n (v)}. As previously, we can then use the dominated convergence theorem to obtain 
an estimate of P [Vt /r ui_ n = —U n — j] as n — > oo. We substitute this estimate in (51) and using 



once again the dominated convergence theorem we obtain (50). 

Now suppose that p has infinite variance and that A is finite. Then W[ is in the domain 
of attraction of a stable law of index 9 and Ui has bounded support hence finite variance. The 
proof of (50) then goes along the same lines as in the finite variance case. 

When p has infinite variance and N\A is finite, W[ has finite variance and U\ is in the domain 
of attraction of a stable law of index 9. The proof of (50) goes along the same lines as when 
p has finite variance by interchanging the roles of W and of U (see (20] for details in the case 
A = N). 
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